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PREFACE 


This volume has a threefold purpose: to explain the physical concepts 
of quantum mechanics, to describe the mathematical formalism, and to 
present illustrative examples of both the ideas and the methods. The 
book is intended to serve as a text at the graduate level and also as a 
reference book. It is assumed that the reader is reasonably familiar 
with atomic structure, classical mechanics, and differential equations. 
In addition, he should have had some contact with electromagnetic theory 
and, for the latter part of the book, with the special theory of relativity. 

The author believes that the analytical methods employed in the book 
will satisfy most theoretical physicists even though no attempt is made 
to achieve mathematical rigor. For example, there is little or no dis¬ 
cussion of the justification for the interchange of sum, derivative, and 
integral operations, or for the use of the 8 function. On the other hand, 
the physical reasons for the nature of the results obtained are investigated 
wherever possible. 

Problems are given at the end of each chapter. They are often used 
to illustrate or amplify points discussed in the text. Original theoretical 
papers are referred to throughout the book; the list is representative 
rather than exhaustive. Experimental results are, for the most part, 
quoted without reference, since the large amount of documentation 
required for an adequate survey seems out of place in a book on theoretical 
physics. Several other books on quantum mechanics and related sub¬ 
jects are referred to for more detailed discussions of particular topics. 

The scope of this volume is best outlined if the book is divided into 
three parts. The first three chapters constitute an introduction to 
quantum mechanics, in which the physical concepts arc discussed and 
the Schrodinger wave formalism is established. The detailed treatment 
of the wave function (Chap. Ill) may be omitted in a first reading. The 
next eight chapters comprise the central part of the book. This part 
presents exact solutions of the wave equation for both energy-level and 
collision problems, the Heisenberg matrix formalism and transformation 
theory, approximation methods, radiation theory, and some applications 
to atomic systems. Since the first eleven chapters correspond to a 
typical one-year graduate course, it seemed desirable to include a semi- 
classical treatment of electromagnetic radiation in the central part of 
the book (Chap. X) even though some of the results arc obtained again in 
Chap. XIV. The last part of the book corresponds to a short course in 
what is often called advanced quantum mechanics. It- consists of rela- 
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.cle theory and an introduction to quantized field theory and 
xectrodynamics. 

all this book was written while the author was at the Uni- 
01 Pennsylvania, and he gratefully acknowledges the continued 
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CHAPTER I 


THE PHYSICAL BASIS OF QUANTUM MECHANICS 

At the present stage of human knowledge, quantum mechanics can 
be regarded as the fundamental theory of atomic phenomena. The 
experimental data on which it is based are derived from physical events 
that lie almost entirely beyond the range of direct human perception. 
It is not surprising, therefore, that the theory embodies physical con¬ 
cepts that are foreign to common daily experience. These concepts did 
not appear in the historical development of quantum mechanics, how¬ 
ever, until a quite complete mathematical formalism had been evolved. 
The need for quantitative comparison with observation, which is the 
ultimate test of any physical theory, in this case led first to the formalism 
and only later to its interpretation in physical terms. 

It seems desirable in introducing the subject of quantum mechanics to 
depart from the historical order and preface the mathematical develop¬ 
ment with a discussion of the physical concepts. In this chapter we first 
review briefly the experimental background and the ideas of the old 
quantum theory, then discuss the newer physical concepts of uncertainty 
and complementarity, and finally lay the groundwork for the formalism 
that will be developed in its most familiar form in Chap. II. No attempt 
will be made to deduce the structure of the formalism from the funda¬ 
mental experiments; we shall try to make the theoretical development 
seem plausible rather than unique. The justification for the theory, 
then, will rest on the agreement between deductions made from it and 
experiments, and on the simplicity (in principle more than in practice) 
and consistency of the formalism. 

1. EXPERIMENTAL BACKGROUND 

Experimental physics prior to 1900 had demonstrated the existence 
of a wide variety of phenomena, which for the most part were believed 
to be explicable in terms of what we now call classical theoretical physics. 
The motions of mechanical objects were successfully discussed in terms 
of Newton’s equations on both celestial and terrestrial scales. Appli¬ 
cation of this theory to molecular motions produced useful results in the 
kinetic theory of gases, and the discovery of the electron by J. J. Thom¬ 
son in 1897 consisted in showing that it behaved like a Newtonian particle. 

1 
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The wave nature of light had been strongly suggested by the diffraction 
experiments of Young in 1803., and was put on a firmer foundation 
by MaxwelPs discovery in 1864 of the connection between optical and 
electrical phenomena. 

Inadequacy of Classical Physics. The difficulties in the understand¬ 
ing of experimental results that remained at the beginning of this century 
were largely concerned with the development of a suitable atomic model 
and with the late discoveries of X rays and radioactivity. However, 
there were also difficulties associated with phenomena that should have 
been understood but actually were not: such things as the spectral dis¬ 
tribution of thermal radiation from a black body, the low-temperature 
specific heats of solids, and the appearance of only 5 degrees of freedom 
in the motion of a free diatomic molecule at ordinary temperatures. 

The beginning of an understanding of the second class of difficulties 
was made by Planck in 1900, when he was able to explain the black-body 
spectrum in terms of the assumed emission and absorption of electro¬ 
magnetic radiation in discrete quanta , each of which contains an amount 
of energy E that is equal to the frequency of the radiation v multiplied 
by a universal constant h (called Planck's constant): 


E — hv (1.1) 

This quantum idea was later used by Einstein in accounting for some of 
the experimental observations on the photoelectric effect. In this way 
the dual character of electromagnetic radiation became established: 
it sometimes behaves like a wave motion, and sometimes like a stream of 
corpuscular quanta. 

At about this time, the existence of discrete values for the measurable 
parameters of atomic systems (not only of electromagnetic radiation) 
became apparent through Einstein’s and Debye’s theories of the specific 
heats of solids, Ritz’s classification of spectral lines, the experiment of 
Franck and Hertz on the discrete energy losses of electrons on collision 
with atoms, and (somewhat later) the experiment of Stern and Gerlach, 
which showed that the component of the magnetic moment of an atom 
along an external magnetic field has discrete values. 

Summary of Principal Experiments and Inferences. The theoretical 
physics of the first quarter of this century thus contained two important 
inferences, obtained from the experiments and their interpretations, that 
had not existed in 1900: the dual character of electromagnetic radiation, 
and the existence of discrete values for physical quantities. The relations 
between the principal experimental conclusions and the theoretical 
inferences are shown schematically in Table 1; for a more detailed dis- 
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cussion and a bibliography, reference should be made to a book on atomic 
physics. 1 


Table 1. Relations between Experimental Interpretations and Theoretical 

Inferences 


Diffraction (Young 1803, Laue 1912). 

Black-body radiation (Planck 1900) 
Photoelectric effect (Einstein 1904) 

Compton effect (1923) 

Combination principle (Ritz-Rydberg 1908 )) 
Specific heats (Einstein 1907, Debye 1912) 
Franck-Hertz experiment (1913) 
Stern-Gerlach experiment (1922) 


{ Electromagnetic 
waves 

{ Electromagnetic 
quanta 

{ Discrete values 
for physical 
quantities 


A third theoretical inference appeared in 1924 with the suggestion by 
de Broglie that matter also has a dual (particle-like and wave-like) 
character; he assumed that the relation between the momentum p of the 
particle and the length X of the corresponding wave is 2 

x - - p (1.2) 

Up to that time all the evidence had indicated that matter was composed 
of discrete Newtonian particles; in particular, sharp tracks of charged 
particles such as electrons and helium nuclei had been observed in expan¬ 
sion cloud chambers like that invented by O. T. R. Wilson in 1911. 
Shortly after this, however, Davisson and Germer (1927) and G. P. 
Thomson (1928) independently observed the diffraction of electrons by 
crystals, and thus confirmed de Broglie’s principal supposition. 


2. THE OLD QUANTUM THEORY 

What is now called the old quantum theory 3 was initiated by the work 
of Planck on black-body radiation, and carried farther by Einstein and 
Debye. However, it was only after Rutherford’s discovery in 1911 that 
an atom consists of a small, massive, positively charged nucleus sur¬ 
rounded by electrons, that the theory could be applied to a quantitative 
description of atoms. 

1 See, for example, E. K. Ric.litinyer and E. H. Kennard, “Introduction to Modern 
Physics” (McGraw-Hill, New York, 1942); M. Born, “Atomic Physics” (Blackie, 
Glasgow, 1940). 

2 Equation (1.2) is also valid for light quanta, as may be seen by dividing both 
sides of Eq. (1.1) by the velocity of light r; for a directed beam of light p = E/c and 
X = c/v. 

3 For a more detailed discussion than is presented in this section, see the books cited 
above, and L. Pauling and E. B. Wilson, Jr., “Introduction to Quantum Mechanics,” 
Chap. II (McGraw-Hill, New York, 1935). 
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Bohr-Sommerfeld Quantization Rules. The first step in this direc¬ 
tion was taken by Bohr in 1913, when he made two postulates concerning 
the electronic or extranuclear structure of an atom. The first of these 
was that an atomic system can exist in particular stationary or quantized 
states, each of which corresponds to a definite energy of the system. 
Transitions from one stationary state to another are accompanied by 
the gain or loss, as the case may be, of an amount of energy equal to the 
energy difference between the two states; the energy gained or lost 
appears as a quantum of electromagnetic radiation, or as internal or 
kinetic energy of another system. The second postulate (in agreement 
with that of Planck and Einstein) was that a radiation quantum has a 
frequency equal to its energy divided by Planck's constant h. 

These two postulates by themselves provided some insight into the 
Ritz combination principle and the Franck-Hertz experiment. To obtain 
specific results for hydrogen, Bohr proposed a simple rule for the selection 
of those circular orbits which are to constitute stationary states: the 
angular momentum must be an integral multiple of h/2r. A more 
general quantization rule was discovered independently by W. Wilson 
(1915) and by Sommerfeld (1916), thus making possible the application 
of Bohr's postulates to a wider variety of atomic systems. This rule is 
applicable to Hamiltonian systems in which the coordinates are cyclic 
variables, and states that the integral of each canonical momentum with 
respect to its coordinate over a cycle of its motion must be an integral 
multiple of h. The rule was applied with considerable success to the 
computation of the fine structure of hydrogen, the spectra of diatomic 
molecules, and other problems. 

Practical Difficulties. The old quantum theory encountered practical 
difficulties in several different respects. It could not be applied to 
aperiodic systems, it provided only a qualitative and incomplete treat¬ 
ment of-the intensities of spectral lines, and it did not give a satisfactory 
account of the dispersion of light. Moreover, improvements in experi¬ 
mental techniques soon showed that there were problems, such as the 
rotational spectra of some diatomic molecules, to which the theory gave 
unambiguous but incorrect answers. 

The correspondence principle was introduced by Bohr in 1923 in an 
effort to make use of the classical theory as a limiting case to infer some 
properties of atomic systems, especially the intensities of spectral lines. 
Although much was achieved in this way, it was clear in the early 1920's 
that the quantum theory as it then existed was unsatisfactory. 

Conceptual Difficulties. Quite apart from the practical difficulties 
outlined above, the old quantum theory failed to give a conceptually 
satisfactory account of the fundamental phenomena. It was difficult to 


Sec. 2] THE PHYSICAL BASIS OF QUANTUM MECHANICS 


5 


understand why the electrostatic interaction between a hydrogen nucleus 
and an electron should be effective when the ability of the accelerated 
electron to emit electromagnetic radiation disappeared in a stationary 
state. The mechanism of emission and absorption of radiation in transi¬ 
tions between stationary states was obscure. The quantization rules 
were arbitrary even when they were most effective. And the assumption 
of a dual character for light (particle-like on emission and absorption 
and wave-like in transit) seemed to be self-contradictory. 

In order to illustrate the conceptual difficulties and the way in which 
they are dealt with by the new quantum mechanics, we consider in some 
detail a simple diffraction experiment, which is illustrated schematically 
in Fig. 1. A light source S illuminates a diaphragm A in which two slits 


<§> 
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( 

( 

< 

<! 
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Fjcj. 1 . A diffraction experiment in which light, from S passes through t-ho two slits in A to 
form a diffraction pattern at B. 


are cut. A diffraction pattern appears at a photosensitive screen B 1 
and the ejected photoelectrons are most numerous at the diffraction 
peaks. Here we have the radiation behaving as a wave during its passage 
from source through slits to screen, but behaving as a stream of light 
quanta or photons when it ejects electrons from B. We now know that a 
similar experiment could be set up with matter instead of radiation. 
The diffraction pattern of electrons scattered from a crystal (analogous 
to the slits in A) may be detected as a distribution of electron tracks in a 
Wilson cloud chamber (analogous to the screen B), so that the wave and 
particle aspects of matter appear in the same experiment. 

In the situation illustrated in Fig. 1, we might at first suppose that 
the diffraction pattern is due to an interference between different photons 
passing through the two slits, thus explaining the observations entirely 
in terms of the particle picture. That this is not a sufficient explanation 
may be shown by decreasing the intensity of the light until an average of 
only one photon at a time is in transit between source and screen. The 
diffraction pattern still appears as the distribution of the large number of 
photons accumulated over a sufficiently long time. Thus we must con- 
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elude that diffraction is a statistical property of a single photon, and does 
not involve an interaction between photons. From the point of view of 
the particle picture, we may then ask how it is that a stream of inde¬ 
pendent photons, each of which presumably can go through only one of 
the slits, can produce a diffraction pattern that appeal’s only when both 
slits are open. Or to put the question in another way, how can the 
presence of a slit through which a photon does not go prevent that photon 
from reaching a part of the screen it would be likely to reach if that slit 
were closed? 

Quantum-mechanical Viewpoint. In this question is implicit the 
assumption that the photon actually does go through a particular one 
of the two slits. This assumption is natural from the point of view of the 
classical theory or the old quantum theory since these theories regard a 
photon or other particle as having a definite and determinable position 
at each instant of time. The quantum mechanics, however, discards this 
assumption, and asserts instead that the position of a photon has meaning 
only when the experiment includes a position determination. Moreover, 
this part of the experiment will affect the remainder of the experiment 
and cannot be considered separately. Thus from the point of view of 
quantum mechanics, the question asked in the last paragraph is without 
meaning, since it assumes that the photon goes through a particular one 
of the two slits (thus making it possible to close the other slit) when there 
is no provision in the experiment for determining through which slit the 
photon actually goes. 

The quantum mechanics resolves the situation by telling us that the 
diffraction pattern is destroyed if a sufficiently careful attempt is made 
to determine through which slit each photon passes (see See. 4). We 
must then be prepared to forego the customary mental picture of a 
photon (or an electron) as a classical particle that has at each instant of 
time a position that can be determined without damage to diffraction 
patterns of the type discussed here. Thus classical causality, which 
requires that the motion of a particle at any time be uniquely determin¬ 
able from its motion at an earlier time, must also be abandoned. The 
new theory that is forced upon us in this way is so successful in other 
respects as well that, at the present state of knowledge, we must regard 
such classically incomplete descriptions as a fundamental properly of 
nature. 


3. UNCERTAINTY AND COMPLEMENTARITY 

Before presenting a more quantitative discussion of the diffraction 
experiment outlined in Sec. 2, we consider two principles that express 
in qualitative terms the physical content of the theory of quantum 
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mechanics. We restrict ourselves here to a discussion of their meaning, 
and give arguments for their validity in Sec. 4. 

Uncertainty Principle. The first of these is the uncertainty principle, 
developed by Heisenberg 1 in 1927. According to this principle, it is 
impossible to specify precisely and simultaneously the values of both 
members of particular pairs of physical variables that describe the behav¬ 
ior of an atomic system. The members of these pairs of variables are 
canonically conjugate to each other in the Hamiltonian sense: a rectangu¬ 
lar coordinate x of a particle and the corresponding component of momen¬ 
tum p X} a component J z of angular momentum of a particle and its 
angular position <jj in the perpendicular (xy) plane, the energy E of a par¬ 
ticle and the time t at which it is measured, etc. Put more quantitatively, 
the uncertainty principle states that the order of magnitude of the 
product of the uncertainties in the knowledge of the two variables must 
be at least Planck’s constant h divided by 2t (h = h/2r = 1.054 X 10" 27 
erg-second), 2 so that 


Ax • A p x > A 

(3.1) 

A <$> • A J 2 > h 

(3.2) 

At - AE > A 

(3.3) 


The relation (3.1) means that a component of the momentum of a 
particle cannot be precisely specified without our loss of all knowledge 
of the corresponding component of its position at that time, that a particle 
cannot be precisely localized in a particular direction without our loss of 
all knowledge of its momentum component in that direction, and that in 
intermediate cases the product of the uncertainties of the simultaneously 
measurable values of corresponding position and momentum components 
is at least of the order of magnitude of A. Similarly, Eq. (3.2) means, 
for example, that the precise measurement of the angular position of a 
particle in an orbit carries with it the loss at that time of all knowledge 
of the component of angular momentum perpendicular to the plane of 
the orbit. Equation (3.3) means that an energy determination that 
has an accuracy A E must occupy at least a time interval At ~ h/AE] 
thus if a system maintains a particular state of motion not longer than 
a time A/, the energy of the system in that state is uncertain by at least 
the amount AE ~ A/A/., since At is the longest time interval available for 
the energy determination. The smallness of //. makes the uncertainty 
principle of interest primarily in connection with systems of atomic size. 

1 VV. TTcisonberg, Zeilx. f. Phfjsik , 43, 172 (1027). 

2 It. T. Tlirgo, Itev. Mod. Plu/». 13, 233 (10-11). 
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As we shall see in Sec. 12, the uncertainty principle may be obtained 
directly from the mathematical formulation of the theory, and this is 
actually the way in which it was first obtained by Heisenberg. 

Complementarity Principle. In order to understand the implications 
of the uncertainty principle in more physical terms, Bohr 1 introduced the 
complementarity principle in 1928. This principle states that atomic 
phenomena cannot be described with the completeness demanded by 
classical dynamics; some of the elements that complement each other to 
make up a complete classical description are actually mutually exclusive, 
and these complementary elements are all necessary for the description of 
various aspects of the phenomena. From the point of view of the experi¬ 
menter, the complementarity principle asserts that the physical apparatus 
available to him has such properties that more precise measurements 
than those indicated by the uncertainty principle cannot be made. 

This is not to be regarded as a deficiency of the experimenter or of his 
techniques. It is rather a law of nature that, whenever an attempt is 
made to measure precisely one of the pair of canonical variables, the other 
is changed by an amount that cannot be too closely calculated without 
interfering with the primary attempt. This is fundamentally different 
from the classical situation, in which a measurement also disturbs the 
system that is under observation, but the amount of the disturbance can 
be calculated and taken into account. Thus the complementarity 
principle typifies the fundamental limitations on the classical concept 
that the behavior of atomic systems can be described independently 
of the means by which they are observed. 

Limitations on Experiment. In the atomic field, we must choose 
between various experimental arrangements, each designed to measure 
the two members of a pair of canonical variables with different degrees 
of precision that are compatible with the uncertainty relations. In 
particular, there are two extreme arrangements, each of which measures 
one member of the pair with great precision. According to classical 
theory, these extreme experimental arrangements complement each other; 
the results of both may be obtained at once and are necessary to supply 
a complete classical description of the system. In quantum mechanics, 
however, the extreme complementary experiments are mutually exclusive 
and cannot be performed together. 

It is in this sense that the classical concept of causality disappears in 
the atomic field. There is causality in so far as the quantum laws that 
describe the behavior of atoms are perfectly definite; there is not, how¬ 
ever, a causal relationship between successive configurations of an 

1 N. Bohr, Nature , 121, 580 (1928); a Atomic Theory and the Description of 
Nature/ 7 especially Part II (Cambridge, London, 1934) \Phys. Rev., 48, 696 (1935). 
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atomic system when we attempt to describe these configurations in 
classical terms. 


4. DISCUSSION OF MEASUREMENT 


In this section we consider two fairly typical measurement experi¬ 
ments from the point of view of the new quantum mechanics. The first 
is the diffraction experiment that was brought up in Sec. 2, and the 
second is an experiment designed to determine the position and momen¬ 
tum of a particle by an optical method. 

Analysis of Diffraction Experiment. We first analyze the diffraction 
experiment of Sec. 2 from the point of view of the complementarity 
principle, assuming that the uncertainty principle is valid. Two con¬ 
trasting arrangements, which would complement each other classically, 



Fig. 2. The experimental arrangement of Kig. 1, modified by the addition of photon 
indioatorH C. 


arc considered here. One of these is illustrated in Fig. 1. Since it is 
assumed that the distance from A to B is large compared to the distance 
between the two slits, and this in turn is large compared to the wave 
length of the light, the distribution of intensity in the diffraction pattern 
determines to good approximation the angular distribution of the photons 
leaving the slits in A, and hence determines the distribution of the 
y components of momentum of the photons beyond A. The second 
arrangement, shown in Fig. 2, determines through which of the two slits 
each photon passes, and hence provides information on the y coordinates 
of the photons. 

In the second arrangement each photon registers itself as it passes 
through a slit by bouncing off one of a number of indicators C placed 
close to A , and giving up to it a y component of momentum that may be 
uncertain by the amount A p v . If we do not want the resultant diffraction 
pattern of many such photons to be destroyed by these events, the 
uncertainty in p y for a particular photon produced by its encounter with 
an indicator must be substantially smaller than would be required to 
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throw the photon from a maximum of the diffraction pattern at B into a 
neighboring minimum. With a photon of momentum this requires 
that 

A p v «6p w (4.1) 


For the simple case in which R^> a ^>\ the angle 0 is known experi¬ 
mentally (or from the theory of wave optics) to be given by 



(4.2) 


in terms of the optical wave length X and the distance a between the slits. 
At the same time, we have not learned through which slit this photon 
passed unless the uncertainty Ay of the y position of the indicator that 
recoiled is less than half the distance between slits. 

Ay < ia (4.3) 

We now use some of the experiments discussed in section 1 [see Eq. 
(1.2)] to establish a relationship between wave length and momentum 
that holds equally well for photons and material particles, 

X = A (4.4) 

It is then apparent from Eqs. (4.1) through (4.4) that the requirement 

that we be able to determine through which slit each photon passes 
without destroying the diffraction pattern at B is equivalent to the 
requirement that 

Ay • Ap v « \h ( 4 . 5 ) 

for each indicator that is used. Since Eq. (4.5) is in disagreement with 
the uncertainty relation (3.1), we may conclude that it is impossible 
to determine through which slits the photons pass without destroying the 
diffraction pattern. 

Discussion of Diffraction Experiment. The situation just analyzed 
shows the intimate connection between the theoretical principles of 
uncertainty and complementarity and the experimental observations 
related to localization and diffraction. It provides an explicit demon¬ 
stration of the validity of the complementarity principle (represented in 
this case by the choice between the mutually exclusive but classically 
complementary experiments for observing the diffraction and for localiz¬ 
ing the photon) when taken in conjunction with the experimentally 
observable properties of matter and radiation. For it shows that no 
fundamental difficulty need be encountered with the photon picture so 
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long as we do not insist on the degree of detail in describing the situation 
that is entailed by classical concepts. 

It is, of course, still necessary to ascribe unfamiliar properties to 
the photons in order to explain the experimental observations; the fore¬ 
going discussion does not show how individual photons can interfere with 
themselves to produce the diffraction pattern. 1 Nor on the other hand 
does it show how an electromagnetic wave can eject photoelectrons from 
the screen. 2 Such demonstrations lie beyond the scope of the qualitative 
discussion of this chapter, and require the use of the mathematical 
formalism of quantum mechanics. However, analysis of the diffraction 
experiment from the point of view of quantum mechanics removes the 
difficulty encountered in Sec. 2: the diffrac¬ 
tion pattern disappears whenever a success¬ 
ful attempt is made to determine through 
which slit each photon passes. 

Localization Experiment. We now con¬ 
sider a particular example of the validity of 
the uncertainty principle, making use of 
a position-momentum determination that is 
typical of a number of somewhat similar ex¬ 
periments that have been discussed in con¬ 
nection with measurements on particles and 
radiation fields. 3 Wo shall consider here 
the accuracy with which the x components 
of the position and momentum vectors of a 
material particle can be determined at the 
same time by observing the particle through 
a rather idealized microscope by means of scattered light. 

The best resolving power of the lens L shown in Fig. 3 is known (again 
either experimentally or from the theory of wave optics) to provide an 
accuracy 

A.r ~ (4*0) 

Sill 6 

in a position determination, where X is the wave length of the radiation 
that enters the lens, and € is the half angle subtended at the particle P 

1 Chapter VI shows the equivalence of the wave theory of Chap. II and the general 
quantum-mechanical theory of particles, so far as matter is concerned. 

2 Chapter XIV shows how the theory of the electromagnetic Field can ho modified 
to include quantum effects. 

3 See, for example, W. Heisenberg, “The Physical Principles of the Quantum 
Theory," Chaps. If, III (University of Chicago Press, Chicago, 1030). 



Fia. 3. An experiment for the 
localization of a particle P by 
moil ns of one of tho scattered 
quanta Q, which is focused by the 
Ions L to form an imago on the 
screen S. 
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by the lens. For simplicity, we consider the ease m which nnlv < mo 
the light quanta Q is scattered onto the screen ^ * (( . aU ‘* \ l< 

aperture of the lens, the precise direction in whic 1 »i< P * (> 011 s< a 01 

into the lens is not known. Then since Eq. (4.4) states ■ wi. M 11l< 
turn of the photon after it is scattered is h/\, the uncoif.nu > in the 
component of its momentum is approximately (A-/X) sin c. 

The # components of the momenta of the photon tuid I n i purti(»l(> 
can be accurately known before the scattering takes place, stnc(* ( horo i:s 
no need then to know the# components of their positions. A 1 m>, ii 01 n* 
position measurement refers to the displacement of the. parlirU* wi1,lx 
respect to the microscope, there is no reason why the total moment urn of 
the sj'stem (particle, photon, and microscope) need be altered during t-Iio 
scattering. Then the uncertainty Ap x in the x componenl of the iiinmtui^ 
turn of the particle after the scattering is equal to the rorrespcm< liri^ 
uncertainty for the photon. 


A p, 


h . 

^ - sm €. 
A 


i 1.7) 


If we combine Eq. (4.6) with Eq. (4.7), we see that just after the sen ( toe¬ 
ing process, 

Az-Ap x ~h (l.cS) 

is the best that we can do for the particle. Thus a realistic aceuun t iii££ 
of the properties of the radiation gives a result in agreement with (.lio 
uncertainty relation (3.1) for the particle. 

This experiment may also be considered from the point of viow of 
the complementarity principle. The complementary arrungemen t sell fToir 
in the choice of w r ave length of the observed radiation: sufficiently small 
A permits an accurate determination of the position of the particle jtmfc 
after the scattering process, and large A of its momentum. 


5. WAVE PACKETS IN SPACE AND TIME 

The relation (4.4) between momentum and wave leniri!i which is 
known experimentally to be valid for both photons and purlieles *„«-! 

dScritM 14 1 ™?* be P° S8lble t0 use concentrated bunches „f wnves U> 
pa . rtlcles of matter and quanta of radial ion. To fix our 
ideas, we shall consider a wave amplitude or wave function 1 1ml .Irpon.l-s ( >n 
the space coordinates x,y,z and the time t This quantity* , V , 
have three basic properties. First, it can interfere with its,.If s„ . ii 
can account for the results of diffraction experime * , . ‘ 

large m magnitude where the particle or nhn+on • , n *?' n ' > 11 1H 

elsewhere And third will h . P 18 likely to ho nnd snijill 
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such quanta. This last is an essential requirement in view of the conclu¬ 
sion of Sec. 2 that a single quantum of matter or radiation interferes 
with itself rather than with other quanta. In this section we shall con¬ 
fine ourselves to a qualitative discussion of the one-dimensional case, in 
which the wave function depends only on x and t, and leave the quan¬ 
titative development for Chap. II. 

Space Packets, A typical form for a concentrated bunch of waves, 
which we shall call a wave packed is shown in Fig. 4a, where is 

plotted against x for a particular time L The average wave length 



Ao and the approximate extension Ax of the packet are indicated in the 
diagram. The Fourier integral analysis 1 of \J/ with respect to x is now of 
interest since it shows how \p may be built up out of continuous harmonic 
waves of various lengths. This is indicated in Fig. 4b, in which the 
Fourier transform of $ is plotted schematically against the propagation 
number k = 

It can be shown by standard mathematical methods that 

" * E (5.1) 

where A k is the approximate spread in propagation number associated 
with the packet. If now wc correlate wave length and momentum as in 
Eq. (4.4), we see that the spread A k corresponds to a momentum spread 

Ap = A = Jj-Ak = h - Ak (5.2) 

Combination of Eq. (5.1) with Eq. (5.2) gives 

Ax ' Ap > h (5.3) 

1 See, for example, L. A. Pipes, “Applied Mathematics for Engineers and Physi¬ 
cists,” Chap. ITT (McGraw-Hill, New York, 1940). 
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which agrees with the uncertainty relation (3.1). Thus the uncertainty 
principle for position and momentum of a quantum of matter or radiation 
follows directly from the wave-packet description and Eq. (4.4). 

Time Packets. In analogous fashion, we may examine the depend¬ 
ence of ^ on the time t for a point x that is typical of the packet, and 
obtain a time Fourier transform that shows how \p can be built up out of 
continuous harmonic waves of various frequencies v . In this case the 
relation between the spread in time of t p and the spread in frequency of 
the time Fourier transform of \f/ is 

At • i- (5.4) 

Equation (5.4) may be brought into correspondence with the uncer¬ 
tainty principle by associating the energy E of a quantum with the fre¬ 
quency of the wave that represents it in a manner similar to the 
association of momentum with wave length given by Eq. (4.4). We 
shall make this connection through Eq. (1.1): 

E = hv (5.5) 

which may be inferred in the case of photons from the experimental 
discussion of Sec. 1. Combination of Eq. (5.4) with Eq. (5.5) then gives 
the uncertainty relation (3.3). 

The assumption that Eq. (5.5) is valid for matter as well as for radia¬ 
tion may be made plausible by computing the group velocity 1 of a wave 
packet that represents a nonrelativistic particle of mass m, kinetic energy 
E } and momentum p for which X and v are given by Eqs. (4.4) and (5.5), 
respectively. The group velocity, which is the velocity of the center of 
the packet, is equal to 

dv _ dE __ d(p 2 /2m) __ p 
d{ 1/X) ~ dp m ( ■ 0 

in agreement with the classical expression for the velocity. This shows 
that with Eq. (5.5), the wave-packet description of the motion of a 
particle agrees with the classical description when the circumstances arc 
such that we can ignore the size and internal structure of the packet. 

Wave Formalism. We see then that quanta of matter or radiation 
can be represented in agreement with the uncertainty principle by wave 
packets that may be superposed to produce interference and whose 
magnitudes give a measure of likelihood of location, provided that use is 
made of the experimentally inferred relations (4.4) and (5.5). It is then 
possible to set up a quantitative formalism based on the mathematical 

1 M. Bom, op. at., pp. 85, 295. 
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analysis of wave motion. This will be done for matter in Chap. II, using 
the physical principles outlined in this chapter as a guide, and requiring 
always that the result of any calculation reduce to the result of the cor¬ 
responding classical calculation in the appropriate limit. This require¬ 
ment is a way of stating Bohr’s correspondence principle, which was 
mentioned in Sec. 2. At the present time, when a reasonably complete 
quantum theory exists, the correspondence principle is mainly of interest 
in assuring that the formalism has the proper classical limit, although it 
may also be of heuristic value in performing new calculations or extending 
the boundaries of the theoiy. 

It might at first be thought that the exclusive use of a wave formalism 
for the description of matter in the next four chapters would conflict 
with the observed particle-wave duality discussed in Sec. 1 and hence 
disagree 'with the complementarity principle. This is not the case, how¬ 
ever, for the formalism actually provides an understanding of all of the 
measurable properties of matter, including, for example, the production of 
particle tracks in a cloud chamber. Thus it will be shown in Sec. 30 
that, if a single material particle is represented by a wave function of 
definite momentum and hence completely undetermined position, the 
probability of ionization of two or more molecules of the cloud-chamber 
gas is negligibly small unless the molecules lie very nearly along a line 
parallel to the momentum vector. 

It must be emphasized that these remarks are true only if a wave 
function of the typo described in this section is always interpreted as 
representing just one particle of matter, and not the statistical distribu¬ 
tion of a number of particles. When we wish to describe more than one 
particle, we must make use of a wave function that depends on the coordi¬ 
nates of all of them. The analogous analytical description of light quanta, 
which will be taken up quantitatively in Chap. XIV, makes use of a 
somewhat different, approach. This is mainly because photons (unlike 
particles as we consider them here) can be emitted or absorbed through 
interaction with matter, so that their number is not fixed. Thus a photon 
wave function would have to depend on a variable number of parameters, 
and it is desirable to avoid such a situation. 

Problems 

1. (Jive a brief description of each of the experiments referred to in Table 1, 
together with their interpretations. 

2. Describe briefly the Davisson-Gcrmer and Thomson experiments, and the use 
of the Wilson cloud chamber for the observation of particle tracks. 

3. A beam of silver atoms for a Stern-Gerlach experiment is produced by eollimat¬ 
ing atoms that vaporize from silver held in a furnace at 1200°G. If the beam travels 
1 meter, use the uncertainty relation to find the order of magnitude of the smallest 
spot that can be obtained at the detector. 
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4 . Show that if a component of angular momentum of the electron of a hydrogen 
atom is known to be 2ft within 5 per cent error, its angular orbital position in the plane 
perpendicular to that component cannot be specified at all. 

5. Uae Eq. (3.2) to show qualitatively that if one component of angular momen¬ 
tum of a particle is precisely known, the two perpendicular angular-momentum com¬ 
ponents cannot be known accurately at the same time. 

6. A 1-ounce rifle bullet takes 0.5 second to reach its target. Regarding the bullet 
as a mass point, and neglecting effects of air resistance and earth motion, find the 
order of magnitude of the spread of successive shots at the target under optimum 
conditions of aiming and firing. 

7. A perfectly elastic ping-pong ball is dropped in vacuum from a height equal to 
ten times its radius onto a perfectly elastic fixed sphere of the same radiuB. Neglecting 
effects due to earth motion, estimate the l&rgeBt number of bounces against the fixed 
sphere that the ball can be expected to make under optimum conditions of release. 

8. A beam of monoenergetic electrons is UBed to excite a particular level of an 
atom in a Eranck-Hertz experiment. If this level is of short duration, owing to radia¬ 
tion back to the ground state, show that the inelastically scattered electrons that 
have lost energy to produce the excited level will not all be expected to have the same 
final energy. If the excited level lasts about 10“ la second, what is the order of magni¬ 
tude of the electron energy spread, measured in electron-volts? 

9. Discuss any connections you can think of that exist between the three uncer¬ 
tainty relations (3.1), (3.2), and (3.3). 
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THE SCHRODINGER WAVE EQUATION 

This and the next several chapters are concerned with the non- 
relativistic motion of a particle in a force field that can be represented by 
a potential energy. A quantitative description of the motion in terms of 
a differential equation, the Schrodinger wave equation , is developed in this 
chapter, and applied to a simple one-dimensional problem. Various 
assumptions have to be made as regards the structure of the wave equa¬ 
tion, the boundary and continuity conditions on its solutions, and the 
physical meaning of these solutions. These assumptions are given a high 
degree of plausibility in this chapter and the next by relating them to 
experimental results, mainly those that deal with the diffraction of 
material particles and with the existence of a classical limit to the quan¬ 
tum mechanics. However, no attempt is made to derive the formalism 
uniquely from a consideration of the experiments. The definitive test 
of the theory must, of course, be its internal consistency and the success 
with which deductions from it agree with particular experimental meas¬ 
urements; some examples will be worked out in Chaps. IV and V. 

6. DEVELOPMENT OF THE WAVE EQUATION 

The form of the Schrodinger wave equation is obtained in this section 
by generalizing the properties of the wave amplitude of See. 5. The 
remainder of this chapter presents a discussion of some of the properties 
of the equation and its solutions. 

Traveling Harmonic Waves. The first task is to develop in more 
quantitative fashion the properties of the one-dimensional wave function 
that was discussed qualitatively in Sec. 5. It was shown there that 
for a continuous traveling harmonic wave, the wave length anti momen¬ 
tum arc related by Eq. (4.4) and the energy and frequency by Eq. (5.6). 
We rewrite these two equations in terms of the universal constant 
h = h/2ir: 

v = hk, k = y (6.1) 

E = ha), a) = 2ttv (6.2) 

A wave function yp(xd) that represents a particle of completely unde¬ 
termined position traveling in the positive x direction with precisely 
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known momentum p and kinetic energy E, would then be expected 1 < > 
have one of the forms 


cos (kx - «0 f ^ (kx - < < f > 

or some linear combination of them. This follows from diffrac tion experi¬ 
ments like those of Davisson and Germer and of Thomson (see Set 1 !. \ 
and from the requirement that a wave packet of approximately tli<> 
propagation number k and angular frequency co has ft group velocity oc juul 
to that of a classical free particle of momentum p anti energy K |sce lOq. 
(5.6)]. 

Need for a Wave Equation. In order to go beyond the very simplest 
problem of a continuous harmonic wave, it is very desirable to have* ;ui 
equation of which both the harmonic waves and more complicated waives 
are solutions. An example from a more familiar field of physics should 
help to clarify this point. In the case of three-dimensional sound waives 
in a gas, it is possible to obtain a solution of the problem of the scattering; 
of sound by a rigid sphere by superposing plane harmonic waves t raveling 
in various directions. But it is far simpler to solve the dilToroiit.iiil 
equation for sound waves directly in spherical polar coordinate's. if tlx. 
temperature of the gas changes from point to point, no progress cun lu> 
made in the general case without such a differential equation. The cor¬ 
rect underlying equation for sound waves can bo found from direct con¬ 
sideration of the mechanical properties of the gas. While I his is mil 
the case with the equation of which the wave functions of Sec. 5 arc s< >Iu- 


tions, it is no less imperative to find the form of the equation. The nocul 
for this becomes more evident when the wave function is to describe < In* 
motion of a particle under the influence of external forces; this sifuul ion 
turns out to be analogous to the propagation of sound waves in nn 
inhomogeneous gas. We shall, therefore, find an equation for \p, and 
having found it, shall regard it as a more fundamental attribute of 11 ■<> 


wave functions than the harmonic forms (0.3). 

The equation must have two basic properties. First, it must 1><> 
linear, in order that solutions of it can be superposed to produce in(<*r- 
ference effects (in the three-dimensional case) and to permit flic con¬ 
struction of wave packets. And second, the coefficients of 1 1„. ,.,,unt ion 
must involve only constants such as h and the mass and charge of < he 
particle, and not the parameters of a particular kind of motion of ( hr 
particle (momentum, energy, propagation number, and frequenev). 
The reason for the latter requirement is that we shall want to leave „pen 
the possibility of superposing solutions that belong to different values of 

«™„, P r me T; this , me “ s ** »“!> • more general wave luneti,,.. 
cannot be a solution of an equation that involves the parameters in its 
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structure. Since differential equations are the easiest to handle, it is 
worth while to try this type first, and it turns out that the requirements 
can be met by a differential equation. 

With all these considerations in mind, we look first at the most familiar 
one-dimensional wave equation, that which describes the motion of 
transverse waves on a string or plane sound waves in a gas: 


dt 2 “ 7 dx 1 


(6.4) 


where y is the square of the wave velocity. Substitution of the forms 
(6.3) into Eq. (6.4) shows that each of the four harmonic solutions, and 
hence any linear combination of them, satisfies this differential equation, 
if and only if we put 

7 k 2 V 2 4m 2 ^ 6 * 6 ' 

where m is the mass of the particle that is to be described by Eq. (6.4). 
Because of the structure of Eq. (6.5) it is apparent that the coefficient y 
that appears in Eq. (0.4) involves the parameters of the motion (E or p); 
we therefore discard this differential equation. 

The One-dimensional Wave Equation. In looking further for a suit¬ 
able equation, it is helpful to note that differentiation with respect to x 
of wave functions like those of (6.3) has the general effect of multiplica¬ 
tion of the function by k (and sometimes also interchanging sine and 
cosine), while differentiation with respect to t has the general effect of 
multiplication by gj, Then the relation E = p 2 /2m, which is equivalent 
to the relation cu = hk 2 /2m i suggests that the differential equation for 
which we are looking contains a first derivative with respect to t and a 
second derivative with respect to x. 


M d'hp 

in =y w 


( 6 . 6 ) 


Substitution shows that the first two of the wave functions (6.3) are not 
solutions of Eq. (6.6), but that either of the last two may be (but not both 
at once) if the constant y is suitably chosen. In particular, if we choose 

io) ihE ih ^ 

7 - T? - 1? - 2m (0 ' 7) 

then the third of the wave functions (0.3) satisfies Eq. (6.6). Moreover, 
the value of y given by Eq. (6.7) involves only the constants h and m. 

We are thus led to the one-dimensional form of the Schrodinger 1 wave 

1 E. Schrodinger, Ann. d. Phy&ik , 79, 301, 480 (1026); 81, 100 (1026). The present 
treatment is somewhat different from that originally given by Schrodinger. 
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equation for a free particle of mass m 3 which from Eqs. (6.6) and (6.7) 
may be written 


75 W* _ 

%n dt ~ ” 2 mdtf 


( 6 . 8 ) 


The particular form in which Eq. (6.8) is written is significant in so far 
as its harmonic solution, the third of the wave functions (6.3), makes 
the left side E$ and the right side (p 2 /2m)^. The fact that the solution 
gf(/sa>—j 0 complex is not in itself a defect of the formalism. We shall 
have to be certain that all predicted results of possible physical observa¬ 
tions are expressible in terms of real numbers, and this will supply a 
condition on the detailed interpretation of 

Extension to Three Dimensions. The foregoing one-dimensional 
treatment is readily extended to three dimensions. It is natural to 
rewrite Eq. (6.1) as 

p = ftk, A = |k| = y (6.9) 


where k is called the propagation vector . Similarly, the third of the wave 
functions (6.3) becomes 

exp [i(k • r — cot)] (6.10) 

where r is the position vector for the particle. Then by an obvious 
extension of the argument that led up to Eq. (6.8), it is seen that the 
three-dimensional Schrodinger equation for a free particle that is repre¬ 
sented by the wave function r y t ) is 


ft 2 9 

ih — = — V 2 ^. 

dt 2m 


( 6 . 11 ) 


A comparison of Eqs. (6.9), (6.10), (6.11), and the classical energy 
equation 



( 6 . 12 ) 


suggests that, at least for a free particle, the energy and momentum can 
be represented by differential operators that act on the wave function \p. 

a 

E —* ih —j p —> — ih grad (6.13) 

The development of Secs. 7, 8, 10, and 11 will show that these are also 
valid representations when the particle is not free. 

Inclusion of Forces. The next problem is to extend the free-particle 
wave equation (6.11) so that it includes the effects of external forces that 
may act on the particle. We shall assume for the present that these 
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forces are of such a nature (electrostatic, gravitational, possibly nuclear) 
that they can be combined into a single force F that is derivable from a 
potential energy V. 

F(r t t) = - grad 7(r,Z) (6.14) 

Just as the classical relation between energy and momentum is used above 
to infer the structure of Eq. (6.11), so it is desirable now to start from the 
corresponding classical relation that includes external forces. This is 
simply expressed in terms of the potential energy 

E = £ + V(t,t) (6.15) 

where E is now the total energy, and the first and second terms on the 
right side of Eq. (6.15) are the kinetic and potential energies of the 
particle, respectively. 

Since V does not depend on p or E, Eqs. (6.15) and (6.13) suggest that 
Eq. (6.11) be generalized into 

-i^VV+W ( 6 . 16 ) 

This is the Schrodinger wave equation that describes the motion of a 
particle of mass m in a force field given by Eq. (6.14). While the intro¬ 
duction of Eq. (6.16) cannot claim as high a degree of plausibility as the 
derivation of the frce-particle equation (6.11), the further discussion 
of the next section should make it more convincing. It is, of course, the 
agreement of solutions of Eq. (6.16) with experiment in particular cases 
that eventually demonstrates the validity and usefulness of this wave 
equation. 

7. INTERPRETATION OF THE WAVE FUNCTION 

The wave function Tp(t y t), which is a solution of the wave equation 
(6.16), is now assumed to provide a quantum-mechanically complete 
description of the behavior of a particle of mass m with the potential 
energy 7(r,/), and hence is analogous to the classical trajectory r(£). 
Thus far, the only interpretative guide available to us is that the wave 
function be large where the particle is likely to be and small elsewhere. 
This has to be supplemented with more detailed statements that enable 
us to get out of i p the maximum amount of information permitted by 
nature, as was discussed in Sec. 3. As with the structure of the wave 
equation, the correctness of our interpretation of the wave function must 
in the end be judged by logical consistency and appeal to experimental 
results. 
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Statistical Interpretation- The phrase “likely to be” in the preceding 
paragraph, together with the discussion of Sec. 3, indicates the need for 
interpreting yp in statistical terms. We can imagine a very large number 
of identical, independent, nonoverlapping regions of space, each large 
enough to contain all the physically interesting features of the motion, 
in each of which the behavior of a particle with the potential energy 
F(r,£) is described by the same wave function yp{t y t)\ in each case r is 
referred to the origin of the particular region. We then make the assump¬ 
tion that the numerical result of the measurement at a particular time t 
(in so far as the time at which the measurement is made can be specified) 
of any physically meaningful quantity, such as position, momentum, 
or energy, will in general not be the same for all the regions. Rather, 
there will be a distribution of these numbers that can be described by a 
probability function. 

For example, we have seen in Sec. 5 that the result of a position deter¬ 
mination is to be regarded as uncertain by an amount of the order of the 
linear dimensions of the Avave function. It is natural therefore to regard 
f as a measure of the probability of finding a particle at a particular 
position Avith respect to the origin of its region. HoAvever, a probability 
must be real and nonnegative, whereas yp is complex in general. Wo 
therefore assume that the product of yp and its complex conjugate ip is 
the 'position probability density . 

Pi r,0 = |*(r,fl|* (7.1) 

This means that P(r y t)dxdydz is to be the probability of finding a particle 
in its volume element dxdydz about its point r at the time t , Avhcn a large 
number of precise position measurements are made on independent 
particles each of Avhich is described by the one-particle Avave function 

’KM)- 

Normalization of \p. The probability of finding the particle some¬ 
where in the region must be unity, so that Eq. (7.1) implies that the Avave 
function is normalized: 

J>(r,0| 2 dr = 1 (7.2) 

where the integral extends over the entire region; here dr is the volume 
element dxdydz. If yp is a wave packet of the type discussed in Sec. 5, 
the integral in Eq. (7.2) converges, and the numerical coefficient of \ p may 
be adjusted so that the integral is unity; such normalization does not of 
course change the fact that yp is a solution of Eq. (0.16), which is homo¬ 
geneous in yp. There are, however, Avave functions like that given in 
Eq. (6.10) for Avhich the integral in Eq. (7.2) does not converge if taken 
over an infinite volume. Such wave functions require special considcra- 
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tion, and will be discussed further in Secs. 10 and 11. For the present, 
we may think of the region of space in which such a wave function is 
defined as being arbitrarily large, but finite; then the integral in Eq. (7.2) 
is over the finite volume of this region and converges, so that normaliza¬ 
tion is always possible. 

The coefficient of ^ that normalizes it must be independent of the 
time in order that i may satisfy the wave equation (fi.lfi). Thus if 
Eq. (7.2) is satisfied at one instant of time, the interpretation of |f| 2 as a 
position probability density requires that the normalization integral be 
independent of the time. That this is actually the ease may be shown by 
computing the time derivative of the integral of P over any fixed volume 
V. 

= ^ f r [*VV- (V W] dr 

ih f 

= 2m j v dlv & grad * “ ^ rad 
ih f 

= 2 m j A W e rad $ ~ (e rad #) '/']’■ dA. 

Here substitution has been made for d^/dt from Eq. (G. 1 (5), and for dijz/dt 
from the complex conjugate of Eq. ((>.10). The last integral is obtained 
by partial integration (use of Green’s theorem), where A is the bounding 
surface of the region of integration and [ ]„ denotes the component of 

the vector in brackets in the direction of the outward normal to the sur¬ 
face element dA . 1 

We define a vector S(r,/) t 

h 

S 0V') = 2im ^ grad ^ ~ fe rad (7.3) 

in terms of which 

J v I J (ht)dT = - Hv S dr = - j A XJA (7.4) 

In the case of a wave packet, for which \p vanishes at great distances and 
the normalization integral converges, the surface integral is evidently 
zero when V is the entire space. For a wave function of the type given 
in Eq. (0.10), \p can be defined in a finite region V so that it vanishes or 
has a periodic structure along the bounding surfaces (see Sec. 10). In all 
these cases, it can be shown without difficulty that the surface integral in 

1 It. is convenient to adopt an order of factors such that i p precedes t/' [nee the dis¬ 
cussion of Eq. (7.7) below|. 
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Eq. (7.4) is zero, bo that the normalization, integral in Eq. (7.2) 1H l ' on - 
stant in time. . 

Probability Current Density. The derivation of Eq- (7.4) also shows 
that the differential relation 

+ div S(r,t) = 0 

at 

is valid. This has the familiar form associated with the consol v.ttion 
of flow of a fluid of density P and current density S, in which thorn aro no 
sources or sinks. It is thus reasonable to interpret S(r } t) given 1 \V hq. 
(7.3) as a probability current density . This interpretation more 

plausible the identification of —ik grad with the momentum in K<1 * 0^3), 
even when a force is present. For then ( h/im ) grad is the ‘velocity 
operator, and it is apparent that 

S(r,£) = real part of grad & j 

While this interpretation of S is suggestive, it must be realize* 1 1 luit S 
is not susceptible to direct measurement in the sense in which /' i*. It 
would be misleading, for example, to say that S(r ,t) is the avera*?<* nu^us- 
ured particle flux at the point r and the time t, for a measurement ut 
average local flux implies simultaneous high-precision measuremc mis of 
position and velocity (which is equivalent to momentum) and is iltc'irfom 
inconsistent with the uncertainty relation (3.1). Nevcrthclouf*, it is 
sometimes helpful to think of S as a flux vector, especially when it- 
depends only slightly or not at all on r, so that an accurate velocity 
determination can be made without impairing the usefulness of the 
concept of flux. 

Expectation Value. The existence of the position probability < tensity 
P(t ,£) makes it possible to calculate what we shall call the c.rpt'rfnlian 
value of the position vector of a particle, which is defined as the* vector 
whose components are the weighted averages of the corresponding com¬ 
ponents of the position of the particle. The expectation value* is (lie 
mathematical expectation (in the sense of probability theory) for the 
result of a single measurement, or it is the average of the result s of a 
large number of measurements on independent systems of the t^po dis¬ 
cussed at the beginning of this section. We write the expectation value 
of r as 

<r) = /rP(r,0dr = /^(r,i)r^(r,«)rfr (7.!») 

which is equivalent to the three equations 

(x) = Wzl/dr, (y) = j^yipdr, (z> = J^z^dr 
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where \p is normalized. The expectation value is a function only of the 
time, since \p and P depend on t and the space coordinates have been 
integrated out. 

The expectation values of any other physically meaningful quantities 
can be found in a similar way if they are functions only of the particle 
coordinate r. Thus the expectation value of the potential energy is 

<F> = iV(r,t)P(v,t)dr = tt(i,t)V(TM(.r,t)dr (7.6) 

A quantity such as momentum or energy must, however, be expressed in 
terms of r and t before a calculation of this type can be made. We assume 
that it is possible to use the differential-operator representations given in 
Eq. (6.13) for this purpose, and justify this assumption with the help 
of the corresponding probability functions in Sec. 10 (for the energy) 
and in Sec. 11 (for the momentum). The question immediately arises, 
however, as to how such differential operators are to be combined with 
the position probability density P. 

This question may be answered by imposing on the expectation values 
the reasonable requirement that 

< e >=< 0 +(f> 

in analogy with the classical energy equation (6.15). In terms of dif¬ 
ferential operators, this may be written 

<*' 4 >-<-» v, > + < y> a7 ' 

It is apparent that Eq. (7.7) is consistent with the wave equation (6.16) 
only if the expectation value is defined in the general case with the 
operator acting on \£, and multiplied on the left by We therefore 
obtain, for example, 

(E) = J $ih ~ dr, (p) = J grad (7.8) 

Like Eq. (7.5), the second of Eqs. (7.8) is equivalent, to the three com¬ 
ponent equations 

(p x ) = -ih J <p„> = —ih J t'-^dr 

(v*) = -ih J 

Ehrenfest’s Theorem. 1 It is reasonable to expect the motion of a 
wave packet to agree with the motion of the corresponding classical 
1 P. Ehrenfest, Zeita. f. Phyaik , 46, 455 (1027). 
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particle whenever the potential energy changes by a negligible amount 
over the dimensions of the packet. If we mean by the 11 position” 
and “momentum” vectors of the packet the weighted averages or expec¬ 
tation values of these quantities, we can show that the classical and 
quantum motions always agree. A component of the “velocity” of 
the packet will be the time rate of change of the expectation value of 
that component of the position; since (x) depends only on the time, and 
the x in the integrand of Eq. (7.5) is a variable of integration, this is 

= ^ dr = j * x Tt dT + J ft ** Ar 


This may be simplified by substituting for the time derivatives of the 
wave function and its complex conjugate from Eq. (6.16) and canceling 
the V terms. 

- /(- Vf)xWr 

= 5ih f — (V 2 tf)^]rfr 

The second integral can be integrated by parts: 

J (V^xipdr = — j (grad ip) • grad ( x\p)dr + j A Or^grad #)»hr. dA 

where the integral of the normal component of x\p grad ip over the infinite 
bounding surface A is zero because a wave packet $ vanishes at great 
distances. A second partial integration, in which the surface integral 
again vanishes, results in 


Thus 


J (V 2 ip)x\pdr = fipV 2 (xip)dT 


A ( x ) = [ 

dt { ' 2 m J 


ip[xVhp — V 2 (x\p)]dr 


m ] r dx 


(7.9) 


Since (#) is seen always to be a real number from the structure of Eq. 
(7.5), Eq. (7.9) shows quite incidentally that (p x ) is real; this can also be 
shown from the second of Eqs. (7.8) when ^ represents a wave packet, 
by means of partial integration. 
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In similar fashion we can calculate the time rate of change of a com¬ 
ponent of the “momentum” of the particle as 



again substituting from the wave equation and integrating twice by parts. 

Equations (7.9) and (7.10), together with their other components, are 
analogous to the classical equations of motion, 


dx _ £ 
dt ~ m 


Tt = ~ grad 7 


They provide an example of the correspondence principle, since they 
show that a wave packet moves like a classical particle whenever the 
expectation value gives a good representation of the classical variable; 
this is usually the macroscopic limit in which the finite size and the internal 
structure of the packet can be ignored. 


8. ENERGY EIGENFUNCTIONS 

The Schrodinger wave equation ((>.10) admits of considerable sim¬ 
plification when the potential energy V{x) does not depend on the time. 
It is then possible to express its general solution as a sum of products of 
functions of r and t separately. 

Separation of the Wave Equation. We consider a particular solution 
of Eq. (0.10) that can be written as a product: \ A(r,£) = ?i(r)/(/); a general 
solution (win then be written as a sum of such separated solutions. If we 
substitute into Eq. (0.10) and divide through by the product, we obtain 

7 + »') 

Since the left side of Eq. (8.1) depends only on t and the right side only 
on r, both sides must be equal to the same separation constant, which we 
call E. Then the equation for/ is readily integrated to give 

i 'El 

m = <v * 
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where C is an arbitrary constant, and the equation for u becomes 

[ ~ M V * + F(r) ] u(r) = Eu(r) M 

Since Eq. (8.2) is homogeneous in u , the constant C may be chosen to 
normalize u. Then a particular solution of the wave equation is 

_ iBt 

= u{i)e (8.3) 

Significance of the Separation Constant E. The time-derivative 
operator given in Eq. (6.13) as a representation of the total energy may 
be applied to the p of Eq. (8.3) to give 

ih ^ = Ep (8.4) 

An equation of the type of Eq. (8.4) is called an eigenvalue equation; \J/ 
is said to be an eigenfunction of the operator that appears on the left, 
and the multiplying constant E that appears on the right is called the 
corresponding eigenvalue. 1 An energy eigenfunction, like the \p in Eq. 
(8.3), is said to represent a stationary state of the particle, since |^| 2 is 
constant in time. 

Equation (8.2) is also an eigenvalue equation. It states that u (and 
hence also ^) is an eigenfunction of the operator [— (A‘V2ra)V 2 _|_ F(r)] 
with the same eigenvalue E. It is, of course, to be expected that is 
an eigenfunction of this operator if it is an eigenfunction of the time- 
derivative operator, since according to the wave equation (G.1G), the two 
operators are equivalent not only for separated functions of the form of 
Eq. (8.3) but also for more general solutions. 

We now anticipate the discussion of the physical significance of 
eigenfunctions and eigenvalues that will be presented in Chap. Ill, and 
assume that the energy eigenvalues E are the only possible results of 
precise measurements of the total energy of the particle. It is then of 
interest to inquire whether or not physically interesting solutions u{ r) 
of Eq. (8.2) exist for all real values of E. An answer cannot be obtained 
until a specification of “physical interest” is found in terms of the bound¬ 
ary conditions that are imposed on u(r). This specification, and the 
general character of the energy eigenvalues that are associated with 
various types of potential energy function F(r), are considered in the 
remainder of this section. 

1 The terms characteristic function and characteristic value are often used in place of 
eigenfunction and eigenvalue. 
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Boundary Conditions at Great Distances. We have thus far encoun¬ 
tered two classes of wave functions: wave packets that are well localized 
and for which the normalization integral J\^\ 2 dr converges, and traveling 
harmonic waves like the function (6.10) that have a constant magnitude 
at great distances so that the normalization integral taken over an 
infinite volume diverges. The first class may be interpreted as represent¬ 
ing particles that, if free, are initially well localized, or that are restrained 
to a particular region of space by external forces derived from the poten¬ 
tial energy V(r ). The second class represents particles that are neither 
localized nor restrained, but travel through the region under consideration 
from one distant part of space to another; such wave functions will be 
useful in describing the scattering of particles by a field of force. 1 In 
either case, the wave functions are bounded at great distances in all 
directions. 

Continuity Conditions. The time-independent wave equation (8.2) is 
a second-order linear differential equation in r. Thus so long as V(r ) is 
finite, whether or not it is continuous, a knowledge of the wave function 
and its gradient along a surface makes it possible to integrate the equa¬ 
tion to obtain the wave function at any point. It is reasonable, therefore, 
to require that the wave function and its gradient be continuous, finite, 
and single-valued at every point in space, in order that a definite physical 
situation can be represented uniquely by a wave function. These 
requirements also have the consequence that the position probability 
density P(i) and the probability current density S(r), defined in Sec. 7, 
are finite and continuous everywhere. 

Boundary Conditions for Infinite Potential Energy. If F(r) is infinite 
anywhere, the appropriate boundary condition can be established by a 
limiting process that starts from a finite F and the above continuity 
conditions. 

Suppose, for example, that there is an infinite discontinuity in V 
across a continuous surface, so that the potential energy is finite on one 
side of it and + °o on the other, and we wish to determine the boundary 
conditions on u( r) and grad u at this surface. The essential features of 
the problem are retained if we replace the continuous surface by the 
plane that is tangent to it at the point of interest, and the continuously 
changing potential energy on one side of the surface by a constant poten¬ 
tial, which can, without loss of generality, be chosen to be zero since any 
constant change in V is equivalent to an equal change in E. We choose 

1 Another possible class consists of wave functions that become infinite at large 
distances; however, these are not of physical interest, since wo have no reason to be 
concerned with particles for which the position probability density becomes inde¬ 
finitely large in remote regions of space. 
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the origin of coordinates at the point of interest and the x axis perpendicu¬ 
lar to the tangent plane. 

The wave equation (8.2) then separates in the three space coordinates, 
and the dependence of u on y and z is not affected by the discontinuity in 
V at the plane x = 0. We wish therefore to solve the one-dimensional 
wave equation 

~w + v ^ u = Eu (8 - 5) 

where V(x) =0 for x < 0, V(x) = Vo for x > 0, and we eventually 
pass to the limit V Q —> + co. If we assume that 0 ^ E < V 0y the general 
solutions of Eq. (8.5) are 

u(x) ® A sin ax + B cos ax, x < 0, a = + 

u(x) = Ce-t* + De?*, x > 0, ft = + —— j 

The boundary condition that u be bounded at great distances requires 
that we set D = 0. Then the continuity of u at x = 0 gives the relation 
B = C, while the continuity of du/dx gives the relation a A = —$C. 
Since p becomes infinite when Vo does, and the solution for x < 0 must 
be finite, the second relation shows that C becomes zero as F 0 —> 
thus also making B zero; A is not determined from these relations, but 
might be fixed by normalization. 

Thus the boundary conditions at a surface at which there is an infinite 
potential step are that the wave function is zero and the component of 
the gradient of the wave function normal to the surface is not determined. 
The assumption above that E < Vo is evidently not a restriction since 
Vo eventually becomes infinite. For E < 0, the sine and cosine in the 
solution for x < 0 are replaced by hyperbolic sine and cosine (which is 
permissible since the solution need hold only near x = 0), with no change 
in the final result. It should be noted that both P and S z vanish as 
x —» 0 from the negative side, so that they are continuous at x = 0 even 
though du/dx is not. 

A boundary surface of this type represents a perfectly rigid, impene¬ 
trable wall, since in the analogous classical situation a particle of any 
finite energy would have its x component of momentum reversed instan¬ 
taneously on contact with the surface. 

Energy Eigenvalues in One Dimension. Energy eigenfunctions that 
represent particles that are restrained to a particular region of space by 
the potential energy (first class) are always characterized by discrete 
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eigenvalues, while eigenfunctions that do not vanish at great distances 
(second class) possess a continuous range of eigenvalues. This may¬ 
be seen qualitatively by considering the nature of the solutions of the one¬ 
dimensional wave equation (8.5). 

We shall suppose at first that V(x) becomes equal to some constant 
value, which may be taken to be zero, for sufficiently large positive and 
negative x, and that E < 0. A classical particle with this total energy E 
cannot escape to infinity, and indeed can exist in the region only if E is 
greater than or equal to the smallest value Vmm of V(x). The permitted 
form of the wave function for \x\ large enough that V = 0 is evidently 
e~P {x \ where /3 = +( — 2mE/h 2 )K These two solutions for large positive 
and negative x can be extended in toward some intermediate point, say 
x = 0, by making use of the wave equation and the continuity conditions. 
At this point, u can always be made continuous by choosing the arbitrary 
multiplying constants for the positive and negative x solutions appropri¬ 
ately. Then a little reflection shows that for an arbitrary value of E , 
they will not meet with the same value of du/dx. There may, however, 
be particular values of E for which both u and du/dx are continuous at 
x = 0. The conditions for this can be seen in the following way. 

In the regions in which E < V(x), d 2 u/dx 2 is positive, and so u is 
convex towards the x axis. Thus the two solutions that are continued in 
from + oo have opposite signs for their ratios of slope to value, 

l du 

u dx 

so long as they are in regions for which E is always less than V. This is 
illustrated in Fig. 56 for the potential shown in Fig. 5 a] both choices of the 
sign of u are shown for x < 0. The points at which E = V (x) are called 
the turning points (TP) of the classical motion, since they are the limits of 
the motion of a classical particle of energy E } at which the particle turns 
around or reverses its motion. At these points d'hi/dx 2, = 0, and u has 
zero curvature. 

Clearly what is needed to make the two solutions join smoothly is a 
region in which E > F(aO, in order that dhi/dx 2 may be negative and u 
be concave toward the x axis; this may permit the ratios of slope to value 
to become equal. Figure 5c shows the two solutions carried in until they 
meet, but for a somewhat small value of E } so that when the id s are made 
the same at x = 0, the slopes are not equal (solid curves), or when the 
slopes arc made the same at x = 0, the u f s are not equal (dotted curve on 
left and solid curve on right). Figure 5 d shows a somewhat larger (less 
negative) value of E } and Fig. 5c a still larger value. The values of E 
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and Vaia are indicated on the u axis, and the turning points are indicated 
on the x axis, in the last three cases. 

Discrete Energy Levels. We see then that an eigenfunction that 
satisfies the boundary and continuity conditions and that represents a 
particle bound by the potential energy 7(z), can exist for the particular 





u(x) 




E fP X 

(e) 

V • 
m/n. 


Fig. 5. (a) Potential energy function and ( b) solutions for large |rc|; (c) and (e) show that 

either the wave function or its slope is discontinuous at x ~ 0 for values of E that are 
smaller (more negative) and larger, respectively, than the energy eigenvalue shown in (d). 

value of E illustrated in Fig. 5 d. In analogy with the classical situation, 
a necessary condition that such an eigenfunction exist is that 7mm < 0, 
in which case E lies between 7 m m and 0; as in the classical case, this con¬ 
dition is also sufficient in one dimension although it is not in three dimen¬ 
sions (see Prob. 10, Chap. IV, and Secs. 9 and 15). 

If the potential energy well illustrated in Fig. 5 a is sufficiently broad 
or deep, there will exist another eigenfunction corresponding to a larger 
energy eigenvalue that is still negative. Figures 6a, 6, and c show a series 
of wave functions analogous to those shown in Figs. 5c, d, and e for suc¬ 
cessively increasing (successively less negative) values of E ; both signs 



Sf,c. 8] 


I HE SCHROdiNQBR WAVE EQUATION 


33 


of « ore Bhown tor * < 0. Thus Figs. 5 d and 6 b show the eigenfunctions 
for the two lowest energy e lg envalues or energy levels of a particle bonZ 
by the potential well T' (,r). It M easy to see by an extension, of the fore- 
gomg qualitative arguments that if there are any higher discrete energy 
levels, each cgoulunotum has one more node than that corresponding to 
the next lowest eigenvalue. 

thus ior a potential energy that approaches a finite constant value as 
x > ± 00 , thei e may bo a finite number of discrete energy levels, or in 





Fio. 0. Solutions for sufficient,ly broad or deep potential and larger (less negative) values 
of E than those shown in Kig. 5. E increases in going from (a) to (6) to (c) and is an 
eigenvalue in (/;), whom the wave function and its Blope can both be continuous at a; =0. 

some eases an iiifinitc* number (if V(x) falls off slowly enough for large 
\x\), depending on 7(.*0 and the mass of the particle. However, if 
V(x) —:> + co as x —> ± 03 , an argument like that given above shows that 
there will always be an infinite number of discrete energy levels; apart 
from arbitrary multiplying constants there will be just one eigenfunction 
u(x) for each of these. 

Continuous Energy Eigenvalues. It is possible to find eigenfunctions 
that obey the boundary and continuity conditions for all-energy eigen¬ 
values (hat, exceed the smaller of the two numbers 7(+oo) and F(— co). 
If, for example, the potential energy has the form illustrated in Fig. 5a, 
then solutions of Urn wave equation can be found for all positive values 
of E . This is because the solutions for large \x\ are of the form 

A sin a|:r| + B cos a\x\, a — + 



( 8 . 6 ) 
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and there is no reason why both terms should not be kept. Thus it is 
always possible to adjust the phase of each of the wave functions for large 
\x\ (which is equivalent to adjusting the ratios A/B for the solutions for 
large positive and negative x ) so that they join together smoothly when 
continued in to x = 0. 

The classical terms periodic (or multiply periodic) and aperiodic are 
sometimes used to designate the particle motions associated with discrete 
and continuous energy eigenvalues, respectively. 

Discrete and Continuous Eigenvalues in Three Dimensions. We 
shall assume without further discussion that all the foregoing results can 




Fig. 7. One-dimensional square well potential with (a) perfectly rigid walls and ( b ) finite 
potential step. 

be taken over in a natural way for the three-dimensional wave equation 
(8.2). We can expect that if F(r) —> + oo as r oo in all directions, 
there will be an infinite set of discrete energy levels extending to + °°. 
If V (r) is bounded as r —» co in some direction, there may be a finite or an 
infinite number of discrete levels, depending on the form of V. In this 
case, the discrete energy levels cannot exceed the smallest value that 
V(cc) has in any direction. For values of E larger than this smallest 
F(o°), the energy eigenvalues ^cover a continuous range extending to 
+ °°. 

9. ONE-DIMENSIONAL SQUARE WELL POTENTIAL 

As a simple explicit example of the calculation of discrete energy levels 
of a particle in quantum mechanics, we consider the one-dimensional 
motion of a particle that is restrained by reflecting walls that terminate a 
region of constant potential energy. Two simple types of potential 
energy are considered. Figure 7a shows a situation in which V(x) = 0 
for —a < x < a, and V{x) = +<*> for \x\ > a, corresponding to per¬ 
fectly rigid, impenetrable walls at the points x = ±a. In Fig. 7b the 
increase in potential energy at the walls is abrupt, but finite, so that 
V(x) = Fo for \x\ > a; because of its appearance, this is often called a 
square well potential . The motion of a classical particle with total energy 
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E less than Vo is the same for both these potentials; but as we shall see, 
the quantum-mechanical behavior is different. In general, an abrupt 
finite increase in potential energy at the boundaries of a region forces a 
particle toward the interior of the region. Such a potential may be 
thought of as a limiting case of a potential of the type shown in Fig. 5a, for 
which the force —dV/dx is always directed in toward x — 0. The force 
associated with a square well potential is zero except at the boundaries, 
so that the particle is acted on by no force except a sudden impulse 
directed toward the origin as it passes the points x = ± a. 

Perfectly Rigid Walls. It was shown in Sec. 8 that the wave function 
must vanish at the points x = ±a, when the potential energy has the 
form shown in Fig. 7a. From Eq. (8.5) the wave equation for \x\ < a 
is simply 


h 2 d?u 

2m dx* ~ Eu 


(9.1) 


which has the general solution 

u(x) — A sin ax + B cos ax , a = + (9.2) 

Application of the boundary conditions at x = ±a gives 

A sin aa + B cob aa = 0, 

— A sin aa + B cos aa = 0, 

from which wo obtain 

A sin aa = 0, B cos aa = 0 

Now we do not want both A. and B to be zero, since this would give 
the physically uninteresting solution u = 0 everywhere. Also, we cannot 
make both sin aa and cos aa zero for a given value of a or E . There arc 
then two possible classes of solutions: For the first class 

A = 0 and cos aa = 0 

and for the second class 

B — 0 and sin aa = 0 

Thus aa = rf,7r/2, where n is an odd integer for the first class and an 
even integer for the second class. The two classes of solutions and their 
energy eigenvalues arc then 


/ \ r> I'VkX 

u{x) = B cos - Q — 
Za 

f x . . mrx 

u{x) = A sin n - 
la 

E7 _ ^ 2 ^ 2 n 2 

8 ma 2 


n odd 


n even 


in both cases 
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It is evident that n = 0 gives the physically uninteresting result u = 0, 
and that solutions for negative values of n are not linearly independent 
of those for positive n. The constants A and B can easily be chosen in 
each case so that the eigenfunctions u(x) are normalized. 

There is thus an infinite sequence of discrete energy levels that cor¬ 
respond to all positive integer values of the quantum number n . There 
is just one eigenfunction for each level, and the number of nodes of the 
nth eigenfunction that are within the potential well is n — 1. These 
results are in agreement with the discussion of Sec. 8 . It is interesting to 
note that the order of magnitude of the lowest or ground-state energy 
level is in agreement with the uncertainty relation (3.1). The position 
uncertainty of order a implies a momentum uncertainty at least of order 
ft/a, which in turn implies a minimum kinetic energy of order h*/ma\ 
Finite Potential Step. When the potential energy has the form shown 
in Fig. 7b, it is necessary to supplement the general solution (9.2), which 
is still valid for |a;| < a since Eq. (9.1) is unaltered there, by a solution for 
\x\ > a. The wave equation in this region is 


A 2 d 2 u , T7 „ 
— ~— j—o + V = Eu 
2m dx 2 


which has the general solution for E < Vo (bound states) 

u(x) = Ce-e* + De? x , (3 = + ( 9 . 3 ) 

The boundary conditions at x = ± co discussed in Sec. 8 require that we 
set D = 0 if Eq. (9.3) is to represent the solution for x > a, and C = 0 
if the solution is for x < — a. 


We now impose on the solutions ( 9 . 2 ) and (9.3) the requirements that 


u and du/dx be continuous at x = ± a. 

A sin aa + B cos aa = Ce~ pa , a A cos aa — aB sin aa = — 0 C'e“ /3 “ 

— A sin aa + B cos aa = aA cos aa + aB sin aa = /3 De~P a 

from which we obtain 

2 A sin aa = (C — D)c“^°, 2 aA cos aa = —p(C — D)e~P a (9.4) 

2 B cos aa - (C + D)e~^ a , 2 aB sin aa = 0 (C + D)e“^ (9.5) 

Unless A — 0 and C = D, Eqs. (9.4) have as their consequence 

a cot aa = —/S (9.6) 

Similarly, unless B = 0 and C = — D, Eqs. (9.5) give 

a tan aa = ft (9.7) 
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Now it is impossible for Eqs. (9.6) and (9.7) to be valid at once, since 
on elimination of p this would require that tan 2 aa = — 1 , which in turn 
would make a imaginary and p negative, contrary to Eq. (9.3). Also, we 
do not want A, B, C, and D all to vanish. Thus the solutions may again 
be divided into two classes: For the first class 

A = 0 , C ~ D, and a tan aa = P 
and for the second class 

B = 0 , C = —D, and a cot aa = —p 

Energy Levels. The energy levels are found from a numerical or 
graphical solution of Eqs. (9.6) and (9.7) with the definitions for a and p 



Fig. 8. Graphical Holution of Eq. (9.7) for three valuen of Wi 3 ; vertical dashed lines are 
the firat two asymptotes of 77 = £ tan £. 


given in Eqs. (9.2) and (9.3). A simple graphical method for effecting 
this solution is described here, since it shows quite clearly the way in 
which the number of discrete levels depends on Fo and a. We put 
£ = aa, 77 = j3a, whence Eq. (9.7) becomes £ tan £ = rj, with 


F + v 2 


2 mV Q a 2 
A 2 


Since £ and 77 are restricted to positive values, the energy levels may be 
found in this case from the intersections in the first quadrant of the curve 
of £ tan £ plotted against £, with the circle of known radius ( 2 mF 0 a 2 /A 2 )i. 
The construction is drawn in Fig. 8 for three values of Fou 2 ; for each of 
the two smaller of these values, there is one solution of Eq. (9.7) and for 
the largest there are two. 
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Figure 9 is a similar construction for the solution of Eq. (9.6) in which 
the energy levels are obtained from the intersections of the same circles 
with the curve of — £ cot £ in the first quadrant. The smallest value of 
7oa 2 gives no solution, and the two larger values each give one. Thus the 
three increasing values of V o<x 2 give altogether one, two, and three energy 
levels, respectively. 

It is clear from Figs. 8 and 9 that for a given particle mass, the energy 
levels depend on the parameters of the potential energy through the com¬ 
bination F 0 a 2 . For Fod 2 between zero and ir 2 h 2 /8m, there is just one 



Pio. e. Graphical aolution of Eq. (0.0) for three values of Foo ! ; vertical dashed line is the 
first asymptote of t) — -{cot f. 


energy level of the first class,’ for V od 2 between ir 2 h 2 /8m and four times this 
value, there is one energy level of each class, or two altogether. As F 0 a 2 
increases, energy levels appear successively, first of one class and then of 
the other. It is not difficult to see from Eq. (9.2) that when ordered 
according to increasing eigenvalues, the nth eigenfunction has n — 1 
nodes. 

Parity. It follows from the foregoing discussion that the eigenfunc¬ 
tions of the first class are even with respect to change in sign of x [u(— x) 
= u(x)], while the eigenfunctions of the second class are odd [«(-*) 
= —«(*)]. This division of the eigenfunctions into even and odd types 
is not accidental, and will now be shown to be a direct consequence of the 
fact that the potential energy function V(x) is symmetric about x = 0. 
If we change the sign of x in the wave equation (8.5) 


V dhc(x) 
2m dx 2 


+ V(x)u(x ) = Eu(x) 


(9.8) 


/ 
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h 2 d 2 u(—x) 
2m dx 2 


+ = Eu(—x) 


Then rt(:r) and w( —rr) are solutions of the same wave equation with 
the same eigenvalue E . Unless there are two or more linearly inde¬ 
pendent eigenfunctions that correspond to this energy level, these two 
solutions can differ only by a multiplicative constant: 

w(— x) = eu(x). (9.9) 

Changing the sign of x in Eq. (9.9) gives u(x) = tu(—x). From these 
two equations it follows at once that 

€ 2 = 1 or € = ± 1 


Thus all such eigenfunctions of a symmetric potential are either even or 
odd with respect to changes of sign of x. Such wave functions are said 
to have even or odd 'parity. 

If an eigenvalue has more than one linearly independent eigenfunc¬ 
tion, these eigenfunctions need not have a definite parity: they need not 
be even or odd. However, we can easily see that linear combinations of 
such eigenfunctions can be found such that each has even or odd parity. 
Suppose that an eigenfunction u(x) does not have a definite parity. It 
can always be written 

u{x) = u,{x) + u 0 (x), 


where 7/ 0 (.r) = i[u (.r) + 7.i(—#)] hs even and u„(x) = ifwfa?) — u(—x)] is 
odd. Then if the wave equation (9.8) is symmetric, we can write it as 


h 2 dht„ , n r 
2mdx 2 + 




E)u 0 = 0 (9.10) 


On changing the sign of x in Eq. (9.10), we obtain 


h 2 dhic 
2m 7 Iv 2 


+ {V- E)u t , + 


h 2 d 2 u„ _ (V 
2m tlx 2 K 


E)u„ = 0 (9.11) 


Addition and subtraction of Eqs. (9.10) and (9.11) shows that u„ and u 0 
are separately solutions of the wave equation with the same eigenvalue E . 

A Simplified Solution. Knowledge that the solutions possess a defi¬ 
nite parity sometimes simplifies the determination of the energy levels, 
since then we need only find the solution for positive x. Even solutions 
have zero slope and odd solutions have zero value at x = 0. If, for 
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example, we wish to find the even solutions, Eqs. (9.2) and (9.3) can be 
replaced at once by 

u(x) = B cos ax, 0 < x < a 

u(x) = Ce~P x , x > a 

Instead of making both u and du/dx continuous at x = a, it is enough to 
make the ratio (1 /u){du/dx) continuous at x = a, since the normalizing 
constants B and C are eliminated thereby. This gives Eq. (9.7) at once. 
Similarly, the odd solutions are 

u(x) = A sin ax, 0 < x < a 

u{x) = Ce~ fix , x > a 

Then continuity of (1 /u) (du/dx) at x = a immediately gives Eq. (9.6). 

Problems 

1* Use the arguments of See. 6 to set up a differential equation for ^ that involves 
a second time derivative of in the case of a free particle. Discuss any solutions that 
this equation has that are not Bhared by the free-particle Schrodinger equation. 

2. Show directly that (p a ) is real for a wave packet. 

3. Show that for a three-dimensional wave packet 

I ^ “ m + 

4. Calculate the energy levels and plot the eigenfunctions for the three bound 
states in the potential of Fig. 7b when Voa 2 » 6 fi 2 /m. Compare with the first three 
states for the potential of Fig. 7a. 

5. Discuss the relation between the energy levels for the potential of Fig. 7b and 
those for the potential: V(x) = + x < 0; V{x) = 0, 0 < x < a; V(x) = V Q , 
x > a. 

6. Show that if the potential energy V (r) is changed by a constant amount every¬ 
where, the wave functions are unchanged. What is the effect on the energy 
eigenvalues? 



CHAPTER III 


EIGENFUNCTIONS AND EIGENVALUES 

In Chap. II, the Schrodinger wave equation was developed and 
applied to a simple problem. Some discussion of the physical interpreta¬ 
tion of the wave function was given in See. 7. This relied for the most 
part on the computation of average or expectation values of operators 
that represent various physical quantities. In addition, however, a 
position probability density was introduced. Thus at the present point 
we are in a position to calculate from the wave function any property 
that depends on the spatial distribution of a particle (for example, the 
mean-square deviation of its position from the average), whereas we can 
only calculate average values of operators other than position. Clearly 
what is needed to round out our interpretation of the wave function is a 
means of computing probability functions for operators in general; this 
would incidentally enable us to obtain the expressions for expectation 
values used in Sec. 7 without separate assumptions. 

In this chapter, we first set up three physical postulates from which a 
complete interpretation of the wave function can be derived, and then 
apply them to a discussion of the total energy and momentum of a 
particle, and to an illustrative problem. The interpretation presented in 
Sec. 7 can then be seen to be a special case of the more general treatment 
given here. 

10. INTERPRETATIVE POSTULATES AND ENERGY EIGENFUNCTIONS 

We start with the wave function which is a solution of Eq. 

(6.16) and describes the motion of a particle that has mass m and poten¬ 
tial energy F(r). From this wave function we wish to obtain as complete 
a description of the properties of the motion of the particle as is consistent, 
with the uncertainty relations discussed in Sec. 3. 

Dynamical Variables as Operators. We first postulate that each 
dynamical- variable Hint relates lo the motion of the 'particle can be represented 
by an operator . The operator may be simply a multiplication operator 
such as r for the position, or it may be a differential operator such as 
—ih grad for the momentum. 

With each operator can be associated an eigenvalue equation, defined 

41 
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near the beginning of Sec. 8. Thus with the operator ft may be associ¬ 
ated the equation 

ft%o) = ciyu>o) (10il) 

where u<* is the eigenfunction of ft corresponding to the eigenvalue w. 

Our second postulate is that one or another of the eigenvalues to is the 
only possible result of a precise measurement of the dynamical variable 
represented by ft. This implies that the eigenvalues of all operators that 
represent physically significant variables are real numbers. 

Expansion in Eigenfunctions. We assume that all the eigenfunctions 
of any dynamical variable constitute a complete set of functions in the 
sense that an arbitrary continuous function can be expanded in terms of 
them. This is a mathematical, not a physical, assumption, and mil 
be discussed further below in connection with energy and momentum 
eigenfunctions. 

Suppose now that a particular wave function i p is expanded in terms 
of the eigenfunctions u u of the operator ft. We adopt the statistical inter¬ 
pretation of \p given at the beginning of Sec. 7, according to which there 
are a large number of identical, nonoverlapping regions of space, in 
each of which is a particle described by We then make measurements 
of the dynamical variable represented by ft on each of these particles. 
Our third physical postulate is that the number of measurements that result 
in the eigenvalue u is proportional to the square of the magnitude of the 
coefficient of u* in the expansion of yj/. This enables us to associate a 
probability function with any dynamical variable. A corollary is that 
we are certain to measure a particular eigenvalue a> only when the wave 
function that describes the particle is the corresponding eigenfunction u c *. 

Rather than develop the consequences of these postulates for an 
arbitrary operator ft, we consider here the total energy of the particle, 
and in Sec. 11 the momentum. Most of the results obtained are readily 
applicable to other physical operators. 

The Total Energy Operator. According to the uncertainty relation 
(3.3), a precise measurement of the total energy of a particle cannot be 
made in a finite length of time. Thus if the total energy is to have a 
definite value, it is essential that the potential energy F(r) be inde¬ 
pendent of the time. Then the operator — (/f 2 /2ra)V 2 + T(r), which is 
shown in Sec. 8 to be equivalent to the total energy operator ih d/dt } has 
eigenfunctions u( r) that need not involve the time. The energy-eigen¬ 
value equation is Eq. (8.2), 

[- g^V* + y(r)]t«.(r) = Eub(j) 


(10.2) 
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where the eigenfunction uk{ r) corresponds to the eigenvalue E, and obeys 
the boundary and continuity conditions of Sec. 8. 

As discussed in Sec. 8, the energy eigenfunctions can be divided into 
two classes: those which are well localized and are associated with discrete 
eigenvalues, and those which remain finite at great distances and possess 
a continuous range of eigenvalues. 

Normalization in a Box. It is often desirable to be able to treat these 
two classes on the same basis; this can be accomplished by enclosing the 
particle under investigation in a box of arbitrarily large but finite volume. 
The simplest physical situation to which this approach is applicable is 
one in which the walls of the box are perfectly rigid, so that, as shown in 
Sec. 8, the wave function vanishes there. Then the discussion of Sec. 8 
shows that all the eigenvalues are discrete. If the box is large in com¬ 
parison with the dimensions of physical interest in the problem, the 
eigenvalues that were discrete in the absence of the box are practically 
unaffected, since before the walls were introduced the wave functions were 
extremely small there. Also, the eigenvalues that were continuously 
distributed in the absence of the box are very closely spaced; this is 
shown explicitly for a free particle in Sec. 11. 

It is more convenient to assume that the wave functions obey periodic 
boundary conditions at the walls of the box than that they vanish there, 
since it is then possible to get a simpler description of the momentum 
eigenfunctions (see Sec. 11). We choose the finite region to be a cube of 
edge length L centered at the origin, and require each wave function 
to have the same value at corresponding points of opposite faces of the 
cube, and the same derivative normal to the wall. These boundary 
conditions make the otherwise continuous eigenvalues discrete, since 
the phase of the eigenfunction at great distances is no longer arbitrary 
[see the discussion of 15q. (8.0)]. As with the rigid-walled box, the pres¬ 
ence of the walls has a negligible effect apart from imparting discreteness 
to the otherwise continuous eigenvalues and providing a finite volume in 
which these wave func(ions can be normalized; we shall continue to use 
the word “continuous” to describe these functions, even when box 
normalization is used. 

Orthonormality of Energy Eigenfunctions. The integral S\u e (t)\Wt, 
which converges in any case for the discrete set of eigenfunctions, con¬ 
verges for all eigenfunctions whim they are normalized in the box of 
finite volume I A The coefficient of u s can then be chosen so that this 
integral is equal to unity, and the uk{t) are normalized. 

We now show that the eigenfunctions corresponding to two different 
eigenvalues E and E ' are orthogonal; that is, that the integral of the 
product of one of them and the complex conjugate of the other over the 
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common domain of the functions is zero. From Eq. (10.2) we have that 
wjj'(r) satisfies the equation 

[ ~ £ V2 + 7(r) ] “* ,(r) = E ' UB,{t) (10,3) 

where we assume in accordance with our physical interpretation that E' 
is real; this is verified below. We multiply Eq. (10.2) by u S f and Eq. 
(10.3) by Us } integrate over the volume L B , and take the difference 
between the two resulting equations. The V terms cancel, and leave 

— ^ J (us*V 2 ub — UBVHiB f )dr = (E — E r ) J u^Usdr (10.4) 

The integral on the left side of Eq. (10.4) can be transformed by 
Green's theorem into a surface integral over the surface A of the cube. 
J (‘Ub i V 2 ub — UsVHiB')dT 


( 

L 


div ( u s ' grad u B — u B grad u E r )dr 
{titf grad u s — u E grad u E ') n dA 


(10.5) 


where the subscript n designates the component of the vector in the 
direction of the outward normal to the element of area dA. Since the 
imposition of periodic boundary conditions gives each wave function and 
its normal derivative the same values at corresponding points of opposite 
faces of the cube, the outward normal derivative has opposite signs on 
opposite faces, and the surface integral in Eq. (10.5) vanishes. Then 
Eq. (10.4) tells us that if E jA Eu E and u E f are orthogonal. 1 

An energy eigenvalue E is said to be degenerate when two or more lin¬ 
early independent eigenfunctions Wi, w 2 , . . . correspond to it. Orthog¬ 
onal linear combinations of degenerate eigenfunctions can be found in 
many different ways. For example, u a = a x ui + a^u 2 can be made 
orthogonal to Ui by choosing the constant coefficients a x and a 2 such that 


Oi juiUidr 

a 2 J\ui\ 2 dr 

this choice does not interfere with the normalization of u a , and u a is still 
an energy eigenfunction with the eigenvalue E . Evidently the choice 
1 It iB apparent that this proof of orthogonality can be applied to a discrete set of 
eigenfunctions even though the box is not introduced, since the w's vanish rapidly at 
great distances and the surface integral, which is then over a sphere of infinite radius, 
is zero. The continuous set of energy eigenfunctions can also be treated without 
using the box normalization (as is the continuous set of momentum eigenfunctions in 
Sec. 11). See E. C. Kemble, “The Fundamental Principles of Quantum Mechanics” 
(McGraw-Hill, New York, 1937); Sec. 30 of Kemble's book presents a discussion of 
this problem and references to the original papers, 
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of orthogonal linear combinations is not unique. By an extension of this 
procedure, all the energy eigenfunctions can be made orthogonal to each 
other even though some of the eigenvalues are degenerate. 

Such a set of eigenfunctions, each of which is normalized and orthog¬ 
onal to each of the others, is called an orthonormal set of functions. We 
specify an orthonormal set of nondegenerate energy eigenfunctions by 
the relation 

Jfc(r)w«(r)r/r = 5*^ (10.0) 

where 5 ek> is the Kronecker fi symbol that equals unity if E = E' and 
is zero otherwise. If there is degeneracy, Eq. (10.6) must be replaced by 

$UE , a'(f)'U']i! 8 (T)dT = 8 hk' fits' (10.7) 

where the index $ distinguishes between orthogonal degenerate eigen¬ 
functions, It is often convenient to omit Explicit mention of s and use 
Eq. (10.6) for degenerate situations as well, in which case the index s is 
implied. 

Reality of the Energy Eigenvalues. We can now see directly that E 
is a real number, as has been assumed. We multiply Eq. (10.2) by u k (t) 
and integrate over the box of volume L 3 . If Uk is normalized, the result is 

E — — ^ J UK^himdr + J V(t)\un\*dT 

which maybe expressed in terms of expectation values as (l/2m)(p 2 ) + { V ). 

The second term ( V) is real since its integrand is real. The first term 
can be shown explicitly to be real by means of a partial integration. 

— J UjiVhiudT = J (grad u K ) • (grad u h )dr — fl*(grad u y ) n dA. 

The volume integral is evidently real, and the surface integral [like that 
in Eq. (10.5)] vanishes because of the periodic boundary conditions at 
the walls of the box. It is interesting to note that (p 2 ) cannot be negative. 

Expansion in Energy Eigenfunctions. As mentioned near the begin¬ 
ning of this section, wo make the mathematical assumption that all the 
eigenfunctions u rs ( r) of the total energy operator constitute a complete 
set of functions in the sense that an arbitrary continuous function can be 
expanded in terms of them. 1 Then if we have any wave function ^( r ) 
at a particular instant of time that is normalized in the box L 3 and obeys 
periodic boundary conditions at the walls, the assumed existence of the 
expansion 

\p(j) = ^ 'd.A’Wjs(r) 

E 

1 For further discussion see E. C. Kemble, op. cit.y Chap. IV and Sec. 30. 


( 10 . 8 ) 
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makes it possible to find unique coefficients As that do not depend on r. 

The coefficients in the expansion (10.8) can be determined by multi¬ 
plying both sides by ti# and integrating over the box. We assume that 

the order of ^ and J dr can be reversed 1 and obtain 

j ftr(rty(r)dr = Y As j ub'(t)ub(i) dr = V A at*# = As’. (10.9) 


with the help of Eq. (10.6) or (10.7). 

The Closure Property. Substitution of the expression (10.9) for As 
back into Eq. (10.8) gives 

lK r ) = X [ / “*( r ')lK r ')dT'] Unix) 

E L 

which we rearrange to give 

V'OO = j »K r 0 wjj(r')wj(r) j dr f (10.10) 

Since ^(r) is an arbitrary continuous function, Eq. (10.10) implies that the 
bracketed part of the integrand vanishes unless r' = r, since otherwise 
the value of $ at the point r given by Eq. (10.10) would change when the 
values of ^ at other points r' ^ r are changed, and this is contrary to 
the assumption that ^ can have an arbitrary form. Moreover, the 
integral of the term in brackets must be unity when the volume of inte¬ 
gration includes the point r 7 = r. We conclude therefore that 


^ ue(i')u e (i) =0, r' 7^ r; 

( 10 . 11 ) 


if the volume of integration includes the point r' = r. 

Equations (10.11) describe the closure property of the orthonormal 
functions Us( r) and are seen to follow directly from their completeness as 
expressed by Eq. (10.8), whether or not they happen to be energy 
eigenfunctions. 

Probability Function and Expectation Value. The second and third 
physical postulates presented at the beginning of this section state that 
the energy eigenvalues are the only possible results of precise measure- 

1 The propriety of changing the order of summations and integrations must in 
principle be investigated separately in each case. The mathematical considerations 
entailed are beyond the Bcope of this book, and we shall always assume that such 
interchanges are permissible in situations of physical interest. 
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ment of the total energy, and that the probability of finding a particular 
value E when the particle is described by the wave function ^(r) is 
proportional to \A E \ 2 . It is easily seen that the proportionality factor is 
unity, for if we put for the energy probability function 

P(E) = \Ae \ 2 (10.12) 

we see that P(E) sums to unity. 

^ P(E) = ^ / M r )^( r )dT / U K (T')$(T')dT' 

E E 

= j j ^(r')^(r) ^ w fi (r)ui(r')] drdr ' 

= J \t(T)\*dr = 1 

since ^ is normalized; use has been made here of Eqs. (10.11). 

We can also compute the average or expectation value of the energy 
from the probability function. 

(E) = £ EP(E) = ^ / Eu„(i)t(T)dT j Mjt ( r'Wr'W (10.13) 

E E 

If we substitute for Eu K from Eq. (10.3), the first integral in Eq. (10.13) 
can be integrated twice by parts as follows: 

j Eu«(r)*(r)dr = j Ut) [“ ^ V* + F(r)] u K {v)dr 

= f u R ( r) ^ V* + F(r)] Hr)dr 

The two surface integrals that result from the partial integrations vanish 
because of the periodic boundary conditions on uh and \f/. Thus with the 
help of Eijs. (10.1 1), Eq. (10.13) becomes 

(E) = ^ / Mr) [ - ^ v* + F(r)]*(r)rfr j u, t (i’)Hr’)dr' 

E 

= JI Hr') {[- + F(r) j f(r)J u,<r)u K (r')] drdr’ 

E 

= / Hr)[~ ^-+ V(r)]nr)dr (10.14) 

The result embodied in Eq. (10.14) confirms the supposition made in 
Sec. 7 that the expectation value of an operator is to be calculated by 
inserting the operator between i£(r) and ^(r), so that it operates just on 
the latter, and integrating over r. 
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General Solution of the Schrodinger Equation. If the potential 
energy 7(r) is independent of t and we know the solution of the Schr&d- 
inger equation (6.16) at a particular time, we can write down a formal 
expression for the solution at any time. We expand ^(r,£) in energy 
eigenfunctions at the time t, in which case the expansion coefficients 
depend on the time. 

t } t) = ^ AE{t)u E {r) y A s (t) = J un(r)\l/(Xyt)dr (10.16) 

Substitution of Eq. (10.15) into the wave equation (G.1G) gives 


ih 



l 


A R (t)Eu R (T) 


(10.16) 


Because of the orthonormality of the u R} Eq. (10.16) is equivalent to 


which may be integrated at once to give 


iJS(t — to ) 

As{t) — A R (to)e ^ (10.17) 

Note that P{E) = |A fi (Z)| 2 = | J 4^(i 0 )| 2 is Constantin time. 

Thus if \j/(r } t) is known at the time t = £ 0 , the solution at any time t 
is given in terms of Eqs. (10.15) and (10.17): 


or 


iE(t-to) 

vHr,i) = > A E (t 0 )e * u E ( r) (10.18) 

A B (to) = / Wre(r')V'(r' ,ta)dr' 

p r VA iE{t — I q) __ 

Mr,t) = j h \MI , ,to)dT , (10.19) 


The solution (10.18) is a linear combination of the separated solutions 
(8.3) obtained earlier. 


11. MOMENTUM EIGENFUNCTIONS 

The eigenfunctions of the linear momentum operator — ih grad pro¬ 
vide a second instructive example of the application of the general ideas 
developed at the beginning of the last section. They also are of con¬ 
siderable usefulness in solving problems of physical interest. 

Form of the Eigenfunctions. The momentum eigenfunctions are 
solutions of the three eigenvalue equations 

-ih grad -u B (r) = pUp(r) 


( 11 . 1 ) 
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or: 


d d 

- *» ^ t*p(r) = p»M P (r), -ift ^ Up(r) 


They have the form 


- ih-^u p(r) = p z u p(r) 


Mp(r) = C exp 


p„Mp(r) 


where C is a normalization constant. 

It is convenient, as in Sec. 0, to change from the momentum vec¬ 
tor p to the propagation vector k — p/ft, and rewrite the momentum 
eigenfunctions 

U]t(x) = C exp {k • r (11.2) 

These are eigenfunctions of the momentum operator with the eigenvalues 
ftk. 

Box Normalization. As with the energy eigenfunctions discussed in 
Sec. 10, we can rcstrieli the domain of the //-k(r) to an arbitrarily large but 
finite cubical box of volume IJ centered at the origin, at the walls of 
which the functions obey periodic boundary conditions. Then u k is 
normalized if C = Also, k is no longer an arbitrary real vector; 

its components are restricted to the values 


. 27 m x , 2 m v 2wn z 

— “ 7 " > A',/ = — 7 — > h' z = —f— 

Jj Jj 


(H.3) 


where n x , n Vi and n z are positive or negative integers or zero. The spac¬ 
ing of neighboring k vectors and of their energy eigenvalues ft a k 2 /2m can 
be made as small as desired by making L sufficiently large. 

It is interesting to note that the momentum eigenfunctions (11.2) 
cannot exist within a box that has perfectly rigid walls, since these eigen¬ 
functions do not vanish anywhere. This is analogous to the classical 
situation in which the momentum of a particle that is reflected from a 
rigid wall is not conserved. On the other hand, the cubical box with 
periodic boundary conditions is equivalent to a situation in which the 
entire infinite space is divided up into adjacent cubes and all wave func¬ 
tions are periodic throughout space with the period L along each of the 
three cartesian axes. If the periodicity of the space is carried over to 
the analogous classical situation, a particle passing through a wall would 
be equivalent to one that strikes that wall and appeal’s at the correspond¬ 
ing point of the opposite wall with its momentum vector unchanged. 

The orthonormality of the momentum eigenfunctions 


^k(r) = lr 3 exp ?!k • r 


(11.4) 
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is readily established. For integration over the volume L 3 

j u\(t)u^(i)dr ~ jj J e i(kx ~ lx)x dx J e i(ky ~~ lv)v dy J e i{h >~ lt)t dz 

— h x ijk v iyh t i t — Ski (11.5) 

where the $'s are Kronecker 5 symbols and use is made of Eqs. (11.3). 
Orthogonality could also have been shown by the more general method 
used in Sec. 10 for the energy eigenfunctions [see Eq. (10.4)]. 

The Dirac 5 Function. It was stated in Sec. 10 that continuous sets of 
eigenfunctions can be handled without introducing the box with periodic 
boundary conditions (which has the effect of making the set discrete with 
an arbitrarily small spacing of eigenvalues). This can be shown explicitly 
for the momentum eigenfunctions with the help of the Dirac S function, 1 
which can be defined by the relations 

5(x) = 0 if x 7*0, S8(x) dx = 1 (11.6) 

where the region of integration includes the point x = 0. An equivalent 
definition is that for an arbitrary function f(x) that is continuous at 
x = 0, the equation 

[f(x)8(x)dx = /(0) (11.7) 

is valid, where again the integration includes the point x = 0. 

It is apparent from a comparison of Eqs. (11.6) and (10.11), or of 
Eqs. (11.7) and (10.10), that the bracketed quantity in Eq. (10.10) can 
be expressed in terms of 8 functions: 

^ u s (r')u s (r) = 8(x — %')8{y - y’)8{z - z ') = 5(r — r') (11.8) 

Comparison of Eqs. (11.8) and (10.6) shows that the closure property 
is a kind of orthonormality of the eigenfunctions with respect to summa¬ 
tion over the eigenvalues. 

A Representation of the 8 Function. The definition (11.6) or (11.7) 
shows that 8{x) is an exceedingly singular function. It may be thought 
of qualitatively as being zero everywhere except at x = 0, and being so 
large there that the area between it and the x axis is finite and equal to 
unity. More formally, it can be represented in a number of different 
ways as the limit of a sequence of analytic functions. 

A particular representation that is quite useful involves (sin gx)/TX 
as a function of x, where g is a positive real number. This has the value 
q/t at x =* 0, oscillates with decreasing amplitude and with period 2tt jg 
as \x\ increases, and has unit integral from x = — oc to x ~ + 00 indepen- 

1 P. A. M. Dirac, “The Principles of Quantum Mechanics,” 2d ed., Secs. 20, 21 
(Oxford, New York, 1935). 
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dently of the value of g. Thus the limit of this function as g —» oo has all 
the properties of the 8 function: it becomes infinitely large at x = 0, 
it has unit integral, and the infinitely rapid oscillations as \x\ increases 
mean that the entire contribution to an integral containing this function 
comes from the neighborhood of x = 0. We can therefore put 


8(x) = lim 
u—* 80 


sin gx 

TTX 


( 11 . 9 ) 


Normalization in Terms of the 8 Function. The representation (11.9) 
of the 5 function can be used to set up an orthonormality integral like that 
given in Eq. (11.5), where now we do not impose the box normalization 
but allow the momentum eigenfunctions to have the form (11.2) over all 
space with all real vectors k. The integral /wi(r)?^(r)rfr is the product of 
three integrals, each of which can be expressed in terms of a 8 function: 



e Wi*-l x )x = I 0 e i{k a —l*)x ^ 

n —> go J 0 


G-* 

= lim 

ff—> OO 


2 sin g(k x — l x ) 


A X 

= 2w8{k x — l x ) 


( 11 . 10 ) 


Thus the momentum eigenfunctions in infinite space can be written 


^k(r) = (8 tt 8 )-* exp ik ■ r (11.11) 

in which case the orthonormality relation becomes 

Jfli(r)ti k (r)rfr = 8(k x - l x )8(k u - l u )8(k s - l s ) = 5 (k - 1) (11.12) 

It will bo shown in Sec. 12 that the box and 5-function normalizations 
of the momentum eigenfunctions give the same final result in a typical 
problem. 

Some Properties of the 5 Function. It is important to note that, 
because of its singular character, the 5 function cannot be the end result, 
of a calculation, and has meaning only so long as a subsequent integration 
over its argument is carried out. With this understanding we can write 
down some relations between 5 functions. 1 


5(.r) = 5(-r) 

5'(.r) = -«'(-*) 

,t5(t) = 0 

x8'(x) = -8(x) (1 1.1 :\) 

8(ax) = a~ l 8(x), a > 0 

5(x 2 — a 2 ) = 2a~ l [8(x — a) + 8(x + a)], a > 0 

J5(a — x)8(x — b)dx = 8(a — b) 

f(x)8(x — a) = f(a)8(x — a) 

Here, a prime denotes differentiation with respect to the argument. 

1 P. A. M. Dirac, op. cit p. 75. 
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Each of the first six of these equations can be established by multiply¬ 
ing both sides by a continuous, differentiable function /(x) and integrating 
over x . For example, the fourth of Eqs. (11.13) gives 

J f(x)x6'(x)dx = - J 8(x) [xf(x)]dx 

= - J5(x)[/(ic) + xf{x)]dx = - Jf(x)5(x)dx 

where the boundary terms that result from the partial integration vanish. 
Thus x5'(x) has the same effect when it is a factor in an integrand as has 
— $(&). Similarly the seventh of Eqs. (11.13) means that the two sides 
give the same result when multiplied by f(a) or f(b) and integrated over a 
or b. The last equation is verified by integrating both sides over either 
x or a. 

Closure. The closure property of the momentum eigenfunctions, 
with both box and 6 function normalization, can be established without 
the help of the completeness assumption that was made in Sec. 10 for 
the energy eigenfunctions. With box normalization, the expression 
analogous to the left side of Eq. (11.8) is 

00 O0 00 

X ^(r'K(r) = L-» XXX g2iri[n*(3J—x / )+»fc v (|/~ y f )+n,(s —\Opt 

This is readily evaluated in the limit of large L , in which case the sum¬ 
mand changes by a negligible amount as each n changes by one unit. 


We can then regard n x as a continuous variable and replace ^ by 

- B f m « 

\ m dn 9 = Ct/Sr) w dk z . We thus obtain 


X Mk(r')«k(r) (87T 8 )- 1 p" e^-^+^^^-^dkJkydk, 

- 5(x - x')d(y - 2/05(2 - 20 = S(r - r 7 ) (11.14) 

on making use of Eq. (11.10). 

A similar calculation can be carried through, using the 5-function 
normalization, in which case we obtain from Eqs. (11.11) and (11.10) 

Ju k (r')ix k (r)dr* = J//tf k (r> k (r) dk x dk y dk z = 5(r - r') (11.15) 


The closure relation (11.14) or (11.15) shows that the momentum 
eigenfunctions are orthonormal with respect to summation or integration 
over the eigenvalue k as well as with respect to integration over the posi¬ 
tion vector r. 
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Expansion in Momentum Eigenfunctions. An arbitrary continuous 
function ^(r) can be written in terms of the S function as 

Hr) = J>(r')5(r - i')dr' (11.16) 

If we substitute the left side of Eq. (11.14) in place of 5(r — r') in Eq. 
(11.16), we obtain 

Hr) = [ Hr') J Uk(r')u k (r)dT' = Y A k u k {r) 

} T T <11.17) 

A k = Juk(r')lA(rOrfr' 

Similarly, if we substitute for 5(r — r') from Eq. (11.15), we obtain 
Hr) = J>(r') /wk (r') «k (i)dr k dr' = fA k u k (r)dr h (11.18) 

with the same expression for A k . Equations (11.17) and (11.18) show 
that it is possible to expand an arbitrary function in momentum eigen¬ 
functions that are normalized either in a box or by means of S functions. 1 

Probability Function and Expectation Value. The momentum 

probability function associated with a normalized wave function ^(r) is 
proportional to |Ak| 2 . The proportionality factor is unity, since if we 
put 

P(k) = |A k | 2 (11.19) 

it is easily shown in analogy with the summation of Eq. (10.12) that 


2 P(k) = 1 and J P(k)dr k = 1 


( 11 . 20 ) 


for the box and 6-funotion normalization, respectively. 

The expectation value of the momentum when box normalization is 
used is 

(p) = h ^kP(k) = J kw k (r)^(r)t/r J (11.21) 

k k 

From the complex conjugate of Eq. (11.1), we can replace kit^r) by 
i grad ^k(r). Then the first integral in Eq. (11.21) can be integrated by 
parts and the surface integral vanishes because of the periodic boundary 
conditions on i p and u k. Thus with the help of Eq. (11.14), Eq. (11.21) 
becomes 

<P> = -tft y J Wk(r) grad jp(r)dr J u k {r')Hr')dT' 

= -ih /JV'(r') [grad ^(r)]5(r - r ')drdr f . . 

’ — -ihJHr) gr&d Hr)dr y 

1 These results, while not rigorously established hero, are equivalent to the mathe¬ 
matical theorems on the expansibility of functions in Fourier series and Fourier 
integrals. 
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This is in agreement with the second of Eqs. (7.8). 

When 5-function normalization is used, the details of the calculation 
are very similar to those given above, except that the surface integral 
that results from the partial integration is over a sphere of infinite radius; 
it is zero because \p becomes vanishingly small at great distances. This is 
consistent with the supposition that \{/ is normalized; otherwise neither 
jP(k)dr k nor (p) have any physical meaning. The result of the calcula¬ 
tion in this case is the same as Eqs. (11.22) and (7.8). 

12. MOTION OF A FREE WAVE PACKET IN ONE DIMENSION 
The motion of a free particle (no external forces) in one dimension is 
described by the Schrodinger wave equation (6.8). The study of this 
motion provides an interesting application of the expansion techniques 
developed in Secs. 10 and 11. As a first step, we find the minimum value 
of the uncertainty product given in Eq. (8.1), and the possible forms of 
the one-dimensional wave packet that correspond to it, all at a definite 
instant of time. The structure of this minimum packet is the same 
whether or not the particle is free, since this form can be regarded simply 
as an initial condition on the solution of the Schrodinger equation for 
any V. However, the analytical work involved in finding at other 
times is especially simple in the force-free case. 

The Minimum Uncertainty Product. 1 In order to find the minimum 
value for the uncertainty produce Ax * Ap, we must first define what we 
mean by Ax and A p. While many expressions are possible, the simplest 
to handle analytically is the root-mean-square deviation from the mean, 
where the word "mean” implies the expectation value of Sec. 7. 

(Aa:) a = ((a: - <x» 2 > = {x 2 ) - (2x(x)) + {(x) 2 ) = (x 2 ) - (x ) 2 . . n 

(Ap) 2 = <(p - (p» 2 > = <p 2 > - <p) 2 1 

Here the equalities follow directly from the general definition of expecta¬ 
tion value given in Sec. 7. If now we put 

« s * - <*>» ^ p - <p> = ~ ih [m - <(£>] (12 - 2) 

then 

(Ax) 2 (Ap) 2 = J \ffa 2 \[/dx J $l3 2 \(/dx 

= ^{d$)(<xf)dx ^ (12.3) 

The transformation of the a integral in Eq. (12.3) is obvious; the similar 
transformation of the f3 integral follows from a partial integration when 
1 W. Heisenberg, “The Physical Principles of the Quantum Theory,” pp. 17-19 
(University of Chicago Press, Chicago, 1930). 
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we remember that ^ is a normalized wave packet, which vanishes at 


X = + oo . 

The inequality 



iJgdx 

7 


dx £ 0 


where all integrals are from x = — oo to + °o, is obviously true, and the 
equality is applicable only if / = yg, where 7 is a constant. From this 
inequality we obtain at once 

If now we replace/by aty and g by j£ty, Eq. (12.3) becomes 

(Aa;) 2 (Ap) 2 ^ \S(5$)(M)dx\ 2 = \W<xffldx\* (12.4) 


The last term in Eq. (12.4) can be written 

\}'$[i(otp - pa) + i(a(i + pa)]\f/(tx\* 

= iUH^P — Pot)\pdx[ 2 + i|/i }{ap + pa)^dx | 2 (12.5) 

The cross term in the product that is omitted on the right side of Eq. 
(12.5) can be seen to vanish when use is made of the relation 

Jifa fiyp <ix = (lx — J (p^)(a\j/) dx = f$pa\f/dx, 

which is obtained by using partial integration and remembering that a 
is real. Now from Eq. (12.2) 


(ap — pa)\p = ^ ih 

~ V dr 

1 

*§: 

11 

( 12 . 0 ) 

We thus obtain from Eqs. ( 12 . 1 ), 

(12.5), 

and ( 12 . 0 ) 


(A.r) s (AyO a ^ W 

or 

A.r ■ Ap ^ ih 

(12.7) 


where the equality can hold only if the second term on the right side of 
Eq. (12.5) is zero. This is the precise expression of the Heisenberg 
uncertainty relation (3.1), when the uncertainties Ax and A p are defined 
as in Eq. ( 12 . 1 ). 

Form of the Minimum Packet. It follows from the foregoing deriva¬ 
tion that the minimum uneorfuinty product is attained only when two 
conditions are fulfilled. 


vn[t = 7 fi\f/ 

J$(ap + f3a)ipdx = 0 


( 12 . 8 ) 

(12.9) 
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Equations (12.8) and (12.2) give us a differential equation for 4 1 

which is readily integrated to give 

t(x) = N exp |^ 2 ~^ (z - (*» 2 + (12.10) 

where IV is an arbitraiy constant. 

Equation (12.9), with the help of Eq. (12.8), becomes 

(^ + f) / MdX = ° 

which evidently requires that 7 be pure imaginary. Then since we want 
Eq. (12.10) to represent a wave packet for which the integral of |^| 2 con¬ 
verges, 7 must be negative imaginary. The magnitude of the constant N 
can now be fixed by normalizing \p. 


M 2 dx = 1 

Similarly, 7 can be determined by requiring that 
J(# — (x)Y\\p\ l dx = (Az) 2 

The integrals are readily evaluated, and lead to the normalized minimum 
wave packet 

*(*) = [2r(A*)*]-iexp [- (12.11) 

Momentum Expansion Coefficients. The one-dimensional momen¬ 
tum eigenfunctions analogous to Eqs. (11.4) and (11.11) are 


Ukix) ~ L-le ikx ( 12 . 12 ) 

for normalization in a one-dimensional “box” of length L, and 

u k (x) = (27t) (12.13) 


for 6-function normalization. Since for a free particle the wave equation 
has the simple form of Eq. (6.8) 


^ __ h 2 d V 

1 dt " ~ 2m to? 


(12.14) 


the momentum eigenfunctions are also eigenfunctions of the energy. 1 

1 The converse is not necessarily true, since there are two solutions of Eq. (12.16) 
for k (positive and negative) for each value of E k . 
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Thus any solution of the wave equation can be written in a form analogous 
to Eq. (10.18), 

\{/(x } t) = or J dk ^ A h e h u k (x), (12.15) 


where the A k are independent of x and the entire time dependence is 
contained in the exponential factor. Equation (12.15) is readily verified 
to be a solution of Eq. (12.14) by direct substitution, provided that 




2m 


(12.16) 


The problem of finding the motion of a wave packet is thus resolved 
into finding the expansion coefficient A k at some particular time, say 
t = 0 , and then using Eqs. (12.15) and (12.16) to find at other 

times. At t = 0, the exponential factors in Eq (12.15) are unity, and 
we may use the one-dimensional analogue of the second of Eqs. (11.17) 
to find A k . 

A k = 5u h (x)yp(xfi)dx (12.17) 

The limits on the integral are x = ±iL or x = ± <*>, according as box 
or 6 -function normalization is used. The momentum probability func- 

iEid 

tionP(/c) = | A k e h | 2 = {A^ is independent of the time, so that (p) and 
Ap, for example, are constants. 

Change with Time of a Minimum Packet. As a simple specific exam¬ 
ple, we take ^(;r, 0 ) to have the form of Eq. (12.11) with (x) = (p) = 0, 
so that the wave packet initially is centered at x = 0 and has zero average 
momentum. Then using box normalization, Eq. (12.17) gives 

A k = [2irJJ( A.r) s ]-1 exp j^- - iTrzj dx 

-[ 

where L is assumed to be so large that the contribution to the integral 
from |a:| > \L can be neglected. Substitution into Eq. (12.15) gives the 
wave function for general values (A l 

'KxJ) = (12.19) 


L 1 J 


(12.18) 


where k = 2 wn/L and n takes on all positive and negative integer values 
and zero. As in Sec. 11, L may be taken arbitrarily large, n can be 
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regarded as a continuous variable and the summation replaced by jdn, 
which in turn is the same as (L/2?r) $dk. Thus 


M) = [^r] f_ _“P [ dk 

- [~ W + W ] (12 - 2 °> 

The position probability density is then 


If Ml* 



(Az ) 2 + 


1W I}"* 

4?n 2 (Aa:) 2 _ J 


• exp 


z 2 \ 

2 

(A#) 2 I ll 

+ 4m 2 (Az) 2 JJ 


( 12 . 21 ) 


Equation (12.21) is of the same form as |^(a;,0)| 2 , except that (Az) 2 
is replaced by (Ar) 2 + hH 2 /Am 2 (Ax) 2 } which is equal to (Az) 2 + (Ap) 2 £ 2 /m 2 . 
Thus the center of the packet remains at x = 0 while the breadth of the 
packet increases as t departs from zero in both past and future directions. 
The smaller the initial uncertainty in position, the larger the uncertainty 
in momentum and the more rapidly the packet spreads; the time-depend¬ 
ent part of the above expression, t(Ap)/m , is simply the distance traveled 
by a classical particle of momentum A p in the time t. 

Use of the 5-function normalization does not alter the results of the 
foregoing calculation. The expression for A k given in Eq. (12.18) is 
to be multiplied by (L/27r)*;inEq. (12.19) the summation is to be replaced 
directly by fdk, thus eliminating a factor L/2w; finally, u k in Eq. (12.19) 
is to be multiplied by (L/2ir)K These three factors cancel, and so Eqs. 
(12.20) and (12.21) are unaffected by the choice of normalization of the 
momentum eigenfunctions. 

Classical Limit. We have seen in Sec. 7 that a wave packet always 
moves like a classical particle in so far as the expectation values of its 
position and momentum are concerned. However, classical dynamics is 
only useful as a description of the motion if the spreading of the wave 
packet can be neglected over times of interest in the particular problem. 

As a simple example of the kind of parameter that indicates when the 
classical limit is realized, we consider a wave packet that corresponds to a 
classical particle moving in a circular orbit of radius a and period T . We 
shall assume that this packet is sufficiently well localized so that the 
potential energy does not vary appreciably over its dimensions. Then 
the classical theory can provide a useful description of the motion only 
if a wave packet like that discussed above spreads by an amount that is 
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small in comparison with a during a time that is large in comparison 
with T. The smallest spread of a packet during a time interval of mag¬ 
nitude t is attained when Ax is chosen to be of order ( ht/m)K We thus 
require that (ht/m) 1 <3C a when T . This condition may be simply 
expressed by saying that the angular momentum 2wma 2 /T of the particle 
must be very large in comparison with h. Thus for most atomic systems, 
Avhere the angular momentum of each electron is of order h , a wave packet 
corresponding to a well-localized particle spreads so much in one period 
that this type of description of the motion is not of physical interest. 


Problems 

1. Given three degenerate eigenfunctions that are linearly independent although 
not necessarily orthogonal. Find three linear combinations of them that are orthog¬ 
onal to each other and arc normalised. Are the three new combinations eigen¬ 
functions? If so, are they degenerate? 

2. Show that so far as the one-dimensional motion of a particle is concerned, the 
functions u x '(x) — 5(x — x') for all real x' constitute a complete orthonormal set, 
and that each of them is an eigenfunction of the position variable x with the eigen¬ 
value x\ Set up the position probability function and compare with that obtained 
in Sec. 7. 

3 . If the potential energy V{x) in a one-dimensional problem is a monotonie 

increasing function oT x and independent of the time, show that the functions 
uv’(x) = (dV/dx)^*^ — x') for all real x where V' = F(;r'), constitute a com¬ 

plete orthononnal set of eigenfunctions of the potential energy with eigenvalues V'. 
Find the probability function for the potential energy, and show that it has the prop¬ 
erties that would he expected of it. 

4. What changes are needed in the discussion of the momentum eigenfunctions 
given in Sec. 11 if normalization is curried through in a box of rectangular parallel¬ 
epiped shape rather than in a box of cubic,al shape? 

6. Find two other representations for the Dirac 5 function like that given in Ftp 

( 11 . 9 ). 

6. Verify each or Fqs. (11.13) involving 8 functions. 

7. Show that the two Fqs. (11.20) arc correct: that the momentum probability 
function defined in Fqs. (11.10) and (11.17) for a normalized ^ sums or integrates to 
unity. 

8. The expression in square brackets in the integrand of Fq. (10.10) enables one to 
calculate ^ at time t in terms of at Lime t n . If this expression is called G fax',1,1*) in 
the one-dimensional ease, then yp(x,l) = {G(x,x'jL 0 )dx' . Show tin t for a free 
particle in one dimension 


m iit\ \ ~i m T f im(x — 

G(x,x ,t,L») = [ 2)rA ({ _ W 'J cxp [ 2 h\l - W J 


Assume that ^ 1ms the form of the normalized minimum wave packet (12.11) at Jo = 0; 
use the above result to find \J/ and \\p\ z at another time l. 
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The formalism that was developed in Chap. II and elaborated in Chap. 
Ill will now be applied to the explicit computation of discrete energy 
levels and the corresponding eigenfunctions. The next chapter will 
take up situations in which the energy eigenvalues are continuously 
distributed. Thus we are concerned here with bound states in which the 
particle is restrained by the external forces (potential energy) to a 
particular region of space, and in the next chapter with collision problems 
in which the particle can approach from and recede to infinite distance. 

The relatively few potential energy functions 7(r) for which analytic 
solutions of the wave equation (8.2) are possible, are important beyond 
these immediate problems, since they often serve as bases for approxi¬ 
mate calculations on more complicated systems. 

13. LINEAR HARMONIC OSCILLATOR 

The one-dimensional motion of a point mass attracted to a fixed 
center by a force that is proportional to the displacement from that center, 
provides one of the fundamental problems of classical dynamics. Its 
study is important not only for itself, but also because more complicated 
systems can often be analyzed in terms of normal modes of motion that 
are formally equivalent to harmonic oscillators. The linear harmonic 
oscillator in quantum mechanics is similarly of importance both for the 
study of such problems as the vibrations of individual atoms in a mole¬ 
cule, and for the analysis of more complicated systems such as crystals 
and (as we shall see in Chap. XIII) quantized wave fields. 

Asymptotic Behavior. The force F = — Kx can be represented by 
the potential energy V(x) = %Kx 2 so that Eq. (8.5) becomes 

It is convenient in dealing with an equation of this type to rewrite it in 
dimensionless form. To this end we introduce a dimensionless inde¬ 
pendent variable £ = ax } and a dimensionless eigenvalue X, and attempt 
to put Eq. (13.1) in the form 
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Comparison of Eqs. (13.1) and (13.2) shows that this is possible if 



where = ( K/m )* is the angular frequency of the corresponding classical 
harmonic oscillator. 

The solution of Eq. (13.2) is facilitated by first examining the domi¬ 
nant behavior of u in the asymptotic region £ —> ± °o . 1 For sufficiently 
large £ it is apparent that w(£) = £ n e±*P satisfies Eq, (13.2) so far as the 
leading terms (which are of order £ 2 u) are concerned, when n has any 
finite value. The boundary conditions of Sec. 8 permit us to keep only 
the minus sign in the exponent. This suggests that it might be possible to 
find an exact solution of Eq. (13.2) of the form 

*(£) = (13.4) 

where i?(£) is a polynomial of finite order in £. Substitution of Eq. 
(13.4) into Eq. (13.2) gives as the equation for //(£) 

H" - 2 £fP + (X - 1 )H = 0 (13.5) 

where primes denote differentiation with respect to £. 

Energy Levels. We now find a solution for H in the form 

= ?(ao + ait + atf +•••), «o ^ 0, s^O (13.6) 

This is necessarily finite for £ = 0 . Equation (13.5) is to be valid for all 
values of £, so that when Eq. (13.6) is substituted into it, the coefficient 
of each power of £ can be equated to zero. 

s(s — l)a 0 = 0 
(s + l)sai = 0 

(s + 2)(s+ 1 )a 2 - ( 2 s + 1 - \)a u = 0 (13.7) 

(s 3)(s -f- 2)<x a — (2s + 3 — X)ui = 0 


(s v -|- 2 )(s -|- v ~f- l)a^ + 2 — ( 2 s -|- 2v + 1 — A)a„ = 0 

where v is an integer. Since a Q cannot be zero, the first of Eqs. (13.7) 
tells us that s = 0 or s = 1. The second equation tells us that s = 0 , 
or ai — 0, or both. Then the third equation gives us a 2 in terms of a 0 , 
the fourth gives us a 3 in terms of ai, and the general equation gives us 
a P+2 in terms of a v . 

It follows from Eqs. (13.7) that the presence in the series (13.6) of a 
finite or an infinite number of terms depends on the choice of s, ai, and 
1 We follow the polynomial method of A. Sommorfeld, “Wave Mechanics,” p. 11 
(Dutton, New York, 1929). 
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the eigenvalue X. If the series does not terminate, its dominant asymp¬ 
totic behavior can be inferred from the coefficients of its high terms 

flp+2 _ 2 

a p * v 

This ratio is the same as that of the series for % n eP with any finite value of 
ft. Equation (13.4) shows that this behavior for H violates the boundaiy 
conditions on u for large |£|. 

Thus the series (13.6) must terminate. This means that 
X = 2s -f- 2v + 1 ) 

v must be an even integer, since a 0 0 and otherwise the even-subscript 
terms would form an infinite series. Since the odd-subscript series can¬ 
not then terminate, we must choose <ii = 0. The index s can still be 
either 0 or 1, and corresponding to these two values X is equal to 2v + 1 
or 2v + 3, where v is an even integer. We may express both cases in 
terms of a quantum number n 

X = 271 -f- 1, E n = (n -f- -^)hoi C) n = 0, 1, 2, . . . (13.8) 

Zero-point Energy. The infinite sequence of energy levels (13.8) has 
the equal spacing postulated in 1900 by Planck, which is in agreement 
with the quantization rules of the old quantum theory. However, the 
finite value of the ground-state energy level iAw c , which is called the 
zero-point energy , is characteristic of the quantum mechanics, and is 
related to the uncertainty principle in the same manner as is the finite 
lowest energy level for the square well with perfectly rigid walls (Sec. 9). 
The total energy is of order ( Ap) 2 /m + K(Ax) 2 , where A p and Ax are 
measures of the spreads in momentum and position, as defined in Sec. 12; 
if this is minimized, taking account of the uncertainty relation (3.1), 
it is easily seen that the minimum A p is of order ( Kmh 2 )*, so that the 
minimum total energy is of order h(K/m -)* or hw e . 

Parity. It follows from Eqs. (13.8) and (13.7) that n is the highest 
value of s + v in the series (13.6) for H. If we denote the corresponding 
polynomial by # n (£), we see that H n is of degree n in £, and is wholly 
even or odd according as n is even or odd. Since e ~is even and has no 
nodes, the corresponding eigenfunction 7* n (£) has the parity of n, and has 
n nodes. These conclusions are in agreement with the earlier results of 
Secs. 8 and 9. 

Hermite Polynomials. The polynomial of order n that has the parity 
of n and is a solution of Eq. (13.5) with X = 272. + 1 

#« - 2£tf' n + 2 nH n = 0, 


(13.9) 
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is called the nth Hermite polynomial #«(£). It is clear from the fore¬ 
going solution of Eq. (13.5) that these conditions define H n uniquely 
except for an arbitrary multiplying constant. It is not necessary, then, 
to use the recursion relations (13.7) to study the detailed properties of 
the H n , if some other formulation of them can be found that is consistent 
with these conditions. A far more convenient formulation is actually 
available, which expresses the H n in terms of a generating function $(£,s). 


S(£,k) = 


-2 

n = 0 


nl 


s n 


(13.10) 


If the exponential in Eq. (13.10) is expanded out in powers of s and £, it is 
seen that a given power of s is associated only with powers of £ that are 
equal to that power or less than it by an even integer. Thus H„(£) 
defined in this way is a polynomial of order n that has the parity of n. 

To show that this II n satisfies the differential equation (13.9), we 
differentiate both sides of Eq. (13.10) first with respect to £ and then with 
respect to s. 

||. - £ Z" "■<«> - X si "'■<» 

I?-.. .. 


Equating equal powers of .s* in the sums of these 1 , two equations gives 
respectively 


If n = 2 fill 

II n i-i — 2£// n — 2 nTf n -.i 


(13.11) 


The lowest order differential etpiation involving only II n that can be 
constructed from Eqs. (13.11) is easily seen to be Eq. (13.0). Thus the 
//„(£) given by 10(i. (13.10) are the Hermite polynomials. 

The relations (13.1 I) may be used for the calculation, of the II n and 
their derivatives, or an explicit expression obtainable directly from the 
generating function may be used. If #(£,«) is differentiated n times with 
respect to .s* and .s* is then set equal to 0, Eq. (13.10) shows that the result 
is simply // n (£). Now for any function of the form/(.s* — £) it is apparent 
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Thus 

— = e P — = (_ i)« c P — 

aa» a«» 1 J a? n 

This gives an expression for the nth Hermite polynomial 


(13.12) 


The first three polynomials calculated from Eq. (13.12) are 
#„(£) = 1, HM) = 2f, ff 2 (£) = 4 £ 2 — 2 

Harmonic-oscillator Wave Functions. The generating function is 
also useful for the calculation of integrals involving the harmonic-oscil¬ 
lator wave functions (13.4) 

u n {x) = N n H n (ca)e-i*'*' (13.13) 

Suppose, for example, that we wish to normalize u n (x ); this is equivalent 
to choosing the constant N n such that 

f \un{x)\*dx = ^ [° HM)e-edt - 1 

The integral on the right can be expressed as a series coefficient in the 
expansion of an integral containing the product of two generating 
functions. 


r™*™**** - X D S (1314) 

n0 771 ™ 0 

The integral on the left of Eq. (13.14) is readily evaluated directly to give 


Tr^e 2tt — 7r* 



(13.15) 


If equal powers of s and t are equated in the series on the right sides of Eqs. 
(13.14) and (13.15), we obtain the results 


f’ m H*U)er*di = x* 2 "nl, 

J* m H n (£)H m (£)e-PdZ = 0 , n 9* m 


(13.16) 


The first of Eqs. (13.16) tells us that the normalizing constant can be 
chosen to be 
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where a constant multiplicative complex phase factor of unit magnitude 
is still arbitrary. The second of these equations tells us that u n (x) and 
u m (x) are orthogonal to each other if n ^ m\ this is in agreement with the 
general result obtained in Sec. 10 for nondegenerate energy eigenfunc¬ 
tions, since in accordance with Eq, (13.8), E n ^ E m if n rn , and so 
there is no degeneracy. 

The integral 

[ u n (x)xu n (x)dx = r %H n (£)H m (g)crPdZ 

J — 00 Oi J _ 00 

is typical of others that can be evaluated with the help of the generating 
function. The two series expressions for the integral, 




**(a + t)e 2st 


00 

-l- 


(s w+1 / n + tS* w / n+l ) 

n\ 


may be equated term by term. With the help of Eq. (13.17), we get 


/ I ( n + 1 \ 
r « \ a V 2 / 

/ u n (x)xu w (x)dx = < 1 (n\ h 

J ~ “ ) cl \ y' 


m = n + 1 


) m 


= n-l (h3.18) 


otherwise 


Correspondence with Classical Theory. Plots of the first six har¬ 
monic oscillator wave functions arc shown in Fig. 10. It is apparent 
that the position probability densities |w n | 2 associated with these station¬ 
ary wave functions have little resemblance to the corresponding densities 
for the classical harmonic oscillator; the latter are proportional to 
(£o ~ £ 2 ) -1 , where £ 0 is the amplitude of the classical oscillator whose 
energy is equal to the quantum-mechanical eigenvalue. The agree¬ 
ment between classical and quantum probability densities improves 
rapidly with increasing n. Figure 11 contains a plot of |it n | 2 for n = 10 
(solid curve), and of the density of a classical oscillator of total energy 
2£hu> 0 (dashed curve). The agreement is quite good on the average, the 
principal discrepancy being the rapid oscillations in l^nl 2 . 



*5 -4 -3 -2 “1 0 I 2 3 4 5 -5-4-3 -2-1012 3 4 5 -5-4 -3 -2 -1 0 I 2 3 4 5 

& U i 

Fig. 10. Energy eigenfunctions for the first six states of the harmonic' oscillator. [ After 
L . Pauling andE. B. Wilson, Jr., “Introduction to Quantum Mechanics ?> 7 >. 7A 75 ( McGraw- 
Hill , tfew For A;, 1935).! 



Fig. 11. Position probability density for the state n — 10 of a harmonic oscillator (solid 
curve) and for a classical oscillator of the same total energy (dashed curve). ( After L. 
Pa/lding and E. B. Wilson, Jr., op. cit., p. 76.) 
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The expectation value for the potential energy can be obtained from 
Eq. (7.6): 

(V) n - f"* U n (x)iKx 2 U n (x)dx 

~ iK ^— ^r ) = + i)hu 0 = 

where $x 2 \u n \ 2 dx can be calculated with the help of the generating func¬ 
tion in analogy with the evaluation of Eq. (13.18). Thus for any value 
of n } the average potential and kinetic energies are each half of the total 
energy, just as is the case with the classical oscillator. 

In similar fashion it can be shown that {x) = (p) = 0 for any 
harmonic-oscillator wave function, so that Eq. (12.1) tells us that 
(A#) 2 = (x 2 ), and (Ap) 2 = (p 2 ). It is then easy to see that the uncer¬ 
tainty product is 

Ax ■ Ap = (n + i)h 

This has the minimum possible value ih of Eq. (12.7) for the ground- 
state eigenfunction 

u a (x) = (13.19) 

7T* 

which, as would be expected, is of the form of the minimum packet 
(12.11). Thus the minimum packet happens to be an eigenfunction of 
the harmonic-oscillator wave equation if its Ax is properly related to IC 
and m. 

Oscillating Wave Packet. In accordance with Eq. (10.18), the 
general solution of the time-dependent Schrodinger equation for the 
harmonic oscillator 

ih | = (- ^ ~ + 

can be expanded in terms of stationary wave functions 

oo iPJ H t “ 

^ A n K n (x)c k = « _i ‘ w ^ A n u n {x)<r i,lu "‘ (13.20) 

7i = 0 n = 0 

where the A n arc arbitrary constants. Thus apart from the phase 
factor i£(x,Z) is a periodic function of t with the period of the classical 

oscillator 2tt/o3 0 . This suggests that, it might be possible to find a solution 
in the form of a wave packet whose center of gravity oscillates with the 
period of the classical motion. 

To investigate this possibility, we assrime that at t = 0 the ^ of 
Eq. (13.20) has the form of the normalized minimum packet (13.19), 



68 


QUANTUM MECHANICS [Chap. IV 


except that the center of gravity is displaoed in the positive x direction 
by an amount a. 

00 

*(*,0) = V A n u n (x) = Sj > a (13.21) 

n p0 

We can make use of the orthonormality of the u n to calculate a particular 
coefficient A m by multiplying Eq. (13.21) through by u m (x) and integrat¬ 
ing over x . 

A n = «.(*)*(*,0)<fe = HM)e' ip er ^ aa 

The integral on the right can be evaluated with the help of the generating 
function by equating term by term the two series expressions for the 
integral 



^*+2*f e -<s=r 





•trig -I &*+»{• 


Tr*e-W 


to 

1 


n *0 


W 

nl 


On making use of Eq. (13.17), we obtain 


4 = 

(2-n!)» 

Substitution of these A„ into Eq. (13.20) gives 


(13.22) 


\p(x,t) = e—if*—(if oe -w )n 

71 — 0 

= exp (- ia 2 - + Ho«r**0 

= ^ exp [ - |(f — f 0 cos w c <) 2 

T 2 

— i(-$W + ££o sin w c t — sin 2co c i)] 

where the sum is evaluated with the help of the generating function 
(13.10). The absolute square of this wave function gives a position 
probability density 


\\p(x,t)\ 2 = —r 
7r* 
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This shows that $ represents a wave packet that oscillates without 
change of shape about x = 0, with amplitude a and the classical frequency. 

As a 0, \p approaches the lowest energy eigenfunction uo(x)er l™ et . 
The larger a becomes, the larger the number of stationary states that 
contribute significantly to the packet, and the larger the quantum num¬ 
ber n 0 for which A n of Eq. (13.22) has a maximum. For n 1, we can 
use Stirling’s formula to maximize In A n ; neglecting terms of order In n 
and lower 

In A n ^ n(ln £ 0 — £ In 2) — in(ln n — 1) 

n 112 _ (13.23) 

no = ^° _ 2^r 0 

Thus the energy level E no = (n 0 + \ )ftco fl , from whose neighborhood most 
of the contribution to \p comes, is approximately equal to the energy 
iKa? of the classical oscillator that has the same amplitude. 


14. SPHERICALLY SYMMETRIC POTENTIALS IN THREE DIMENSIONS 


It is generally impossible to obtain analytic solutions of the three- 
dimensional wave equation (8.2) unless it can be separated into total 
differential equations in each of the three 
space coordinates. It has been shown 1 
that there are 11 coordinates systems 
in which the free-particle wave equa¬ 
tion [Eq. (8.2) with 7 = 0] can be 
separated. One of the most important 
of these is the spherical polar-coordinate 
system, in terms of which the rectangular 
coordinates are given by (see Fig. 12) 


x = r sin 6 cos <£ 
y = r sin 6 sin <£ 
z = r cos 6 



Fig. 12. llolation between roc- 
tangulur and spherical polar coordi¬ 
nates of a point P. 


If the potential energy is spherically symmetric, so that 7(r) = 7(r) is a 
function only of the magnitude r of r measured from some origin, the wave 
equation can always be separated in spherical coordinates. Many prob¬ 
lems of physical interest can be represented exactly or approximately 
in terms of spherically symmetric potentials of various shapes. In 
this section we effect the separation, and solve the resulting total dif- 
1 L. P. Eiscnhart, PhijR. Rev., 46, 428 (1934). See also L. Pauling and K. B. Wilson, 
Jr., “Introduction to Quantum Mechanics,” Appendix IV (McGraw-Hill, Now York, 
1936). 
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ferential equations in 0 and <£. The next two sections deal with the 
solution of the radial equation for particular forms of 7(r). 

Separation of the Wave Equation. The wave equation (8.2) with a 
spherically symmetric potential energy may be written in spherical 
coordinates 


h 2 \ 1 9 ( * d \ \ 1 d ( • * d \ 1 d 2 1 

2m |_r 2 dr \ dr) + r 2 sin 6 dd V™ d dd) + r 2 sin 2 6 d<j>' 2 ] U 

+ V(r)u = Eu (14.1) 

We first separate the radial and the angular parts by substituting 

v(r,Q,<l>) = R(r)Y(0,4>) 
into Eq. (14.1) and dividing through by u . 

“ “ f [sKl ft (“" * w) + EH jf] < u2 > 

Since the left side of Eq. (14.2) depends only on r , and the right side 
depends only on 6 and <£, both sides must be equal to a constant that we 
call X. Thus Eq. (14.2) gives us a radial equation 

+ < 143 > 


and an angular equation 
1 d 


sin 6 dd 


( sine ^) 


+ 


d*Y 


sin 2 6 d<\>- 


+ \Y = 0 


04.4) 


The angular equation (14.4) can be further separated by substituting 
Y(6,<j>) = 0(0)4>($) into it and following the same procedure to obtain 


d 2 $ 


+ V$ = 0 


1 d 
sin 6 dd 


( Si ""sl) + ( x_ asb ) e " 0 


(14.5) 

(14.0) 


The 4> equation (14.5) can be solved at once; its general solution may be 
written 


<*>(<#>) = Ae^* + Be-* 1 **, 
$(<#>) = A + B<t >, 


v ^ 0 
v = 0 


(14.7) 
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The requirement of Sec. 8 that $(<f>) and d$/d<$> be continuous through¬ 
out the domain 0 to 2r of <f> demands that v be chosen equal to the square 
of an integer. We thus replace Eqs. (14.7) by 

= (2r)(14.8) 

where now all physical meaningful solutions are included if m is allowed 
to be a positive or negative integer or zero; 1 the multiplying constant is 
chosen equal to (27 t)”& in order that $ be normalized to unity over the 
range of <f>. 

Legendre Polynomials. Unless V ( r ) is specified, the farthest we can 
carry our treatment is the solution of the 0 equation (14.6), where now 
v = m 2 . It is convenient to substitute w — cos Q for 0, and put 

0(0) = P(w), 

when Eq. (14.G) becomes 

Since the domain of 0 is 0 to it, the domain of w is 1 to — i. The procedure 
for solving Eq. (14.9) is in many respects similar to the solution of the 
harmonic-oscillator wave equation presented in Sec. 13, and will not be 
given in detail here.’ 2 Since Eq. (14.9) is a second-order differential 
equation, it has two linearly independent solutions. Except for particu¬ 
lar values of X, both of these are infinite at w — ± 1, and in accordance 
with Sec. 8 are not physically acceptable. If however X = 1(1 + 1), 
where l is a positive integer or zero, one of the solutions is finite at w = ± 1 
(the other is not); this finite solution has the form (1 — w 2 )lM times a 
polynomial of order l — |m| in w , and has the parity of l — \m\. 

The physically acceptable solutions of Eq. (14.9) when m = 0 are 
called the Legendre 'polynomials Pi(w). Just as is the case with the Her- 
mite polynomials, their properties may be discussed in terms of a generat¬ 
ing function 

T(w,s) = (1 - 2.s w + **)-i 
00 

= V I\(wy, s < 1. (14.10) 

l-l0 

1 At the very slight risk of confusion with the mass of the particle, we make use of 
the customary symbol m for the quantum number associated with the coordinate <j>. 

2 For a complete discussion of this equation, sec E. T. Whittaker and G. N. Watson, 
“A Course of Modern Analysis,” 4th ed., Chap. XV (Cambridge, London, 1935). 
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Differentiation of the generating function with respect to w and s leads to 
relations that are analogous to Eqs. (13.11) for the Hermite polynomials. 

(l - w^Pi - - IwPi + ZPi_i ru in 

(I + l)Pi+i = (2 1 + 1 )wP t - iP w U ' 

where primes denote differentiation with respect to w. The lowest order 
differential equation involving only Pi that can be constructed from Eqs. 
(14.11) is easily seen to be Eq. (14.9) with X = 1(1 + 1) and m = 0. 

For m not necessarily equal to zero, Eq. (14.9) has physically accept¬ 
able solutions if X = 1(1 + 1) and \m\ g 1 These solutions, which are 
called associated Legendre functions , are expressible in terms of the 
Legendre polynomials 

d\m\ 

P?( W ) - (1 _ «*)*M V«(w) (14.12) 

This can be shown by substitution of Eq. (14.12) into the equation that is 
obtained by differentiating |m| times the equation for Pi(w). The 
generating function for the associated Legendre functions is obtained by 
differentiating Eq. (14.10) \m\ times with respect to w and multiplying 
by (1 — t 0 2 )*H. 

m _ (2|m|)!(l - w 2 )iW s W 

; 2W(|m|) !(1 - 2sw + s 2 )W+l 

cn 

= y Pr(w)s l (14.13) 

Spherical Harmonics. The angular part Yi m {6,<j>) of the complete 
wave function, which is a solution of Eq. (14.4) when X = 1(1 + 3), is 
called a spherical harmonic. It is apparent that 

Yi m (d,<i>) = NimPTte os *)$,(*) (14.14) 

where $ m (<#>) is given by Eq. (14.8), and Ni m is the normalization constant 
for the associated Legendre function. 

The same proof that was given in Sec. 10 for the orthogonality of the 
energy eigenfunctions may be used to show that solutions of Eq. (14.4) 
corresponding to different eigenvalues X or l are orthogonal. The 
eigenvalue l is, however, (21 + l)-fold degenerate, since there exist 
linearly independent solutions Yi m (d,<l>) for this value of l and all integer 
values of m between +1 and — l. The choice of Eq. (14.8) for <h„,(4>) 
makes these degenerate eigenfunctions orthogonal. We have then that 
the integral 

Jo r sin 8d6d<j) = Y i m Y Vm 4wd<i> 
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vanishes unless l — V and m = m'. It is interesting to note that there is 
no more orthogonality present than is necessary to make this integral 
vanish when it should. Thus the part of the integral vanishes when 
m 7 ^ m' without regard for the l values; the 0 or w part of the integral 
vanishes only when l ^ V and \m\ = |m'|, since for in ^ m' the ortho¬ 
gonality is taken care of by the integration over 0 . 

The integral 

fl x P?(w)P™(w)dw (14.15) 

can be evaluated in various ways, for example, by using the generating 
function (14.13) in a manner similar to that described in Sec. 13. As 
expected, the integral (14.15) vanishes unless l = 2', when it has the 
value [2/(21 + 1)][(2 + \m\)\/(l — |m|)!]; thus Ni m , which contains an 
arbitrary complex phase factor of unit magnitude, may be taken to be 
the reciprocal of the square root of this quantity. The normalized 
spherical harmonics are then 

YUe,4>) = [ ? —- 1~ j^jjj ‘iYCcos (14.16) 

The first four spherical harmonics are 

r - (s) 1 " in »' J " 

- (iy ““ ** 

Parity. The concept of parity introduced in Sec. 9 can now be 
extended to three-dimensional problems of the type discussed in this 
section. Suppose that the position coordinate r is reflected through the 
origin so that r is replaced by — r*, this corresponds to replacing x by — .r, 
y by — y, and z by — z , or to replacing 0 by tt — 0, <$> by (j> + tt, and leaving 
r unchanged. It is clear that the only change in the wave equation 
(14.1) is that u(r y 6,<t>) is replaced by u(r } ir — 0,<£ + t), the rest of the 
equation being unaffected. Then the discussion of Sec. 9 shows that 
orthogonal linear combinations of degenerate eigenfunctions can be 
found that have definite parities, and that a nondegenerate eigenfunction 
must have a definite parity. 

The energy levels for a spherically symmetric potential are degenerate 
at least with respect to the quantum number in, for l > 0 . In this case, 
the degenerate eigenfunctions all have the same parity, which we now 
show to be the parity of 2. When r is reflected through the origin, the 
radial part li(r) of the solution is unchanged, the </> part #( 4 >) given by 
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Eq. (14.8) has the parity of |m|, and the 0 part Pf (cos 0) has the parity 
of l — \m\, since P?(w) is equal to an even part (1 — w 2 )* ,ml times a poly¬ 
nomial in w that has the parity of l — |m|with respect to change in sign 
of w or cos 0. Thus and hence u(r), has the parity of l. 

Angular Momentum. The radial wave equation (14.3) may be 
rewritten in a form that resembles the one-dimensional wave equation 
(8.5). If we put R(r) = x(r)/r, the equation for the modified radial 
wave function x may be written 


Thus the radial motion is similar to the one-dimensional motion of a 
particle in a potential 


V(r) + 


1(1 + l)ft 2 
2 mr 2 


(14.18) 


The additional “potential energy” can be seen physically to be connected 
with the angular momentum in the following way. A classical particle 
that has angular momentum M about the axis through the origin per¬ 
pendicular to the plane of its orbit has the angular velocity co = M/mr 2 
when its radial distance from the origin is r. An inward force 


is required to keep the particle in this path; this “centripetal force” is 
supplied by the potential energy, and hence adds to the V(r) that appears 
in the radial motion an additional “centrifugal potential energy” 
Af 2 /2mr 2 . This has exactly the form of the extra term in (14.18) if we put 

M = [1(1 + 1)]W 

The foregoing physical argument for identifying the quantum number 
l with the angular momentum of the particle can be put in quantitative 
form by finding the operators that correspond to the three components 
of the angular momentum vector. Classically, we have that M = r X p, 
so that we take in quantum mechanics 

M x = yp z - zp v = -ih(y±-z-^ 

My = zp x - xp t = — ih x (14.19) 

M, = x Vu - yp x = - ih (x±-y£j 

Equations (14.19) can be transformed into spherical polar coordinates to 
give 
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M. 

M, 


' x = ih ^si 
„ = ih (^— 


sin <t> ^ + cot 8 cos c#> 
00 


cos <£ — + cot 0 sin 
do 


£) 




(14.20) 


The operator that represents the square of the total angular momentum is 
then found from fiqs. (14.20) to be 


M* = Ml + M\ + Ml 

1 d (-■ « a \ 
sin 8 i)6 \ Sin 0 dd) 


= -ft 2 


+ 


1 


sin- 6 d(j) 


d 2 1 

W 1 ] 


(14.21) 


Comparison of Eqs. (14.21) and (14.4) shows that Ffc ft (0,(f>) is an eigen¬ 
function of M- wit.lv the eigenvalue 1(1 + l)h 2 . 


M *YU0,4>) = Kl + 1 WY lm (6,<l>) (14.22) 

In similar fashion, it follows from the structure of Eq. (14.8) and the last 
of Eqs. (14.20) that <I> m (4>), and hence also Yi„X0,<l>), is an eigenfunction of 
M a with the eigenvalue mh. 

M:Yi m (Q,<t>) = mhYUQrf) (14.23) 

Thus the separation of the wave equation in spherical polar coordi¬ 
nates results in wave functions that are eigenfunctions ol both the total 
angular momentum and the component of angular momentum along 
the polar axis. The quantum number l that appears in Eq. (14.22) is 
called the a zimulhol or orbital (imjular-mwnmLum, quantum number. The 
quantum number m that appears in Eq. (14.23) is called the magnetic 
quantum number t since it is of importance in (lie theory of the Zeeman 
effect (see Bee. 30), which involves the component, of angular momentum 
along the magnetic field (z axis). It should be noted that the wave 
equation cannot in general be separated in this way and angular-momen¬ 
tum eigenfunctions obtained if the potential energy V(r) is not spherically 
symmetric. This corresponds to the classical result that the angular 
momentum is a constant, of the motion only lor a central field of force 
(which is describable by a spherically symmetric potential). There is, 
however, the characteristic difference between classical and quantum 
theory that all three components of M can be precisely specified at once 
in the classical theory, whereas only M z and M“ can in general be pre¬ 
cisely specified at once in the quantum theory, since Yi vl (0,<l)) is not an 
eigenfunction of M x and M y (except for the case l = 0). This result can 
be related to the uncertainty principle (see Prob. 5, ('hap. I). The choice 
of the direction of the polar axis that distinguishes M z from M x and M v 
is, of course, completely arbitrary; it corresponds to the arbitrariness of 
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the direction of space quantization in the absence of external fields in 
the old quantum theory. 


16. THREE-DIMENSIONAL SQUARE WELL POTENTIAL 

We are now in a position to find the bound-state energy levels that 

correspond to particular choices of the 
potential energy 7(r) and of the angular- 
momentum quantum number Z, by solving 
the radial wave equation (14.3). As a first 
example, we consider the square well poten¬ 
tial of finite depth, for which V(r ) = — Fo, 
r < a, V(r) = 0, r > a, where Vo is positive 
(see Fig. 13). A spherical region of this 
type in which the potential is less than that 
of the surroundings serves to attract a par¬ 
ticle just as in the one-dimensional case 
considered in Sec. 9. 

Zero Angular Momentum. When Z = 
0, it is easier to solve the wave equation in the form (14.17) than in the 
form (14.3). In this case, 12 (r) = xM/ 7 ’, and the wave equation is 



Pig. 13, Spherically symmet¬ 
ric square well potential of 
depth Vq and radius a. 


- s & - 

“ SS $3 ■ r>a 


(15.1) 


The solution of Eqs. (15.1) is the same as that obtained for the finite 
potential step in Sec. 9, except for three points: first, the energy scale is 
lowered everywhere in the present problem by an amount Vo ; second, the 
domain of r is 0 to + <x>, in place of the domain — oo to + co for x; and 
third, the boundary condition that the wave function not become infinite 
at x = — oo is now replaced by the same condition at r = 0. 

From the discussion of Sec. 9, it is apparent that the solutions of 
Eqs. (15.1) are 


x(r) = A sin or + B cos or, 
x(r) = Ce-er, 



r < a, 

* - m 

r > a 


(15.2) 


where we are interested in bound-state energy levels for which E < 0. 
The requirement that R{t) be finite at r = 0 demands that we set B = 0 
in the first of Eqs. (15.2). Thus the solution has the form of the odd 
parity solution of the one-dimensional problem. The energy levels are 
obtained by equating the two values of (1 /x){dx/dr) at r = a (this is 
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equivalent to making (1 /R){dR/dr) continuous there), and are given by 
solving 

a cot aa — —18 (15,3) 

which is the same as Eq. (9.6). Then it follows from the discussion of 
Fig. 9 that there is no energy level unless Voa 2 > 7 r 3 A 2 / 8 m; there is one 
bound state if ir 2 h 2 /Sm < VqCl 2 ^ 97 r 2 ft 2 / 8 m, etc. 

Interior Solutions for Arbitrary L For nonzero values of l , it is more 
convenient to work with the original radial equation (14.3) than with the 
equation for x- If we put p = cer, where a is defined in Eq. (15.2), the 
wave equation for r < a becomes 


, 2dR , 
+ ~pT p + 


i(l + 1 ) 




(15.4) 


The strong resemblance between Eq. (15.4) and Bessel's equation sug¬ 
gests that R(r) can be expressed in terms of Bessel functions. This is in 
fact the case; if we define the “spherical Bessel function'' ji(p) that is 
regular at p = 0 by 1 


-&)'■ 


(15.5) 


where J is an ordinary Bessel function of half-odd-integer order, it is 
easily verified that jz(p) satisfies Eq. (15.4). The definition (15.5) also 
applies (except for sign) to the relation between the “spherical Neumann 
function" ni{p) that is singular at p = 0 and the corresponding singular 
(negative-order) solution of Bessel’s equation. 

It can l)e shown 2 that ./z-h(p), where l is a positive or negative integer 
or zero, is expressible as a sum of products of sin p and cos p with poly¬ 
nomials of odd order in p“ *. In particular, explicit expressions for the 
first three f s and n’s are 


(15.(5) 



«o(p) = 

COS P 

p 

COS p 

p 

ni(p) = 

COS p 
•) 

p 

V Hin p 



3 

-- cos P, 

0* 

>h(p) = 

- (A _ 

\P 5 


3 . 

? smp 


The leading terms for small p are 3 


1 This definition and the properties of the ji and m are taken from P. M. Morse, 
“Vibration and Sound,” 2d od., pp. 316-317 (McGraw-Hill, New York, 1948). 

a G. N. Watson, “Theory of Bessel Functions,” 2d cd., p. 62 (Macmillan, New 
York, 1946). 

3 Equations (16.7) are useful approximations for p 2 somewhat less than 4^+6 and 
2, respectively (G. N. Watson, op. cit., p. 44). 
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(15.7) 


and the leading terms in the asymptotic expansions are 1 
ji(p) -* - cos [p - h(l + 1)tt] 

P—> 00 P 

ni(p )-> - sin [p — + l)ir] 

P-V QO p 


(15.8) 


Some properties of the j’s and n’s are 

fjo(p)p 2 dp = ip 3 D'o(p) + Wo(p)ji(p)] 
fn 2 0 (p)p 2 dp = ip s [n§(p) - j 0 (p)tti(p)] 

ni-i(p)ji(p) — Mp')Ji-i(p) = 

P~ 

Up)^- p ni(p) - n i(p)^ p ji(ft) 

The following are properties of both the j’s and the n’s: 


l > 0 


(15.9) 


jt-l(p) + jn-l(p) = ~—^—jl(p), l > 0 

P 

^Jj(p) = 2l~-f i t(7'-i(p) — (Z + l)ji+j(p)] 
d 

jp [p l+1 3i(p)] = p z+1 i^i(p), l > 0 (15.10) 

[p _ yi(p)i = -p- i ji+i(p) 

5ji(p)dp = — jo(p) 

Jio(p)p 2 ^p = p 2 ji(p) 

foi{p)p 2 dp = ip a [j 2 (p) ~ ji-i(p)ji+i(p)] l > 0 

Since R(r) must be finite for r = 0, the desired solution for r < a is 

R(r) = Aj z («r) (15.11) 

Exterior Solutions for Arbitrary l . The wave equation for r > a can 
be put in the form (15.4) if we redefine p to be ipr, where /3 is given in Eq. 
(15.2). Since the domain of p does not now extend in to zero, there is 
1 Equations (15.8) are useful approximations for p somewhat larger than ]l(l + 1) 
(G. N. Watson, op. til p. 199); however, the magnitudes (although not the phases) of 
ji and rii are given to good approximation by (15.8) if p is somewhat larger than l t 
which is approximately the value of p for which the magnitude of ji is greatest. 
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no reason why ni cannot, appear in the solution. The linear combination 
of ji and Ui to be selected will be determined by the asymptotic form, 
which must fall off exponentially for large r. This suggests that we define 
spherical Hankel functions 


WKp) = ji{p) + ini{p) 
W\p) = jt(p) - in t (p ) 


(15.12) 


which from Eqs. (15.8) have the asymptotic forms 

/'i.|»(p)-> I C ilf- 

p —► eo p 

},P(p) -> - e -<[,-Ki+i)rl 

p—► OO p 


(15.13) 


It can be shown that the asymptotic expansions, of which Eqs. (15.13) 
give the leading terms, contain no terms for which the exponent has the 
opposite sign to that given. 

The desired solution for r > a is then 


«(r) = Bh\"(ipr) = + tm(tPr)] (15.14) 

The first three of these functions are 


ww = - j r <^ r 

h\mar) = iQp + ^,) (15.15) 

liX'W) = (jfr + + jgV 3 ) e ~ 0T 


Energy Levels. '1’he energy levels are obtained by requiring that 
(\/Ii)(tUi/<lr) be continuous at r = a. When this condition is applied to 
the interior solution (15.11) and the exterior solution (15.15) with l = 0, 
we obtain Eq. (15.8). This may bo written as 


£ <■«>(, £ = -i), 


£- + 17 2 


2m V o a 1 
h' 1 ~ 


(15.16) 


where, as in See. 9, wo have put £ = era and r\ = j 8a. The same condi¬ 
tion applied to the solutions for / = l reduces, with the help of Eqs. 
(15.0) and (15.15) to 


col, £ 1 1,1 

—z -= - T 

£ £‘ V V 


£ 2 + 17 s 


2m Vna 1 
“A* 


(15.17) 


Equations (15.17) may be solved numerically or graphically, by the 
methods indicated for the solution of Eqs. (15.16) in Sec. 9. Tn general, 
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there is no degeneracy between the eigenvalues obtained from the solution 
of equations like (15.16) and (15.17) for various values of L 

It is easy to see how many energy levels Eqs. (15.17) give for various 
values of Vq<x 2 without going through the numerical work. A new level 
appears whenever rj is zero or cot £ is infinite. This occurs at £ = t, 
2tt> . . . . Thus there is no energy level with 1=1 when 


Y 0 a 2 ^ 


ttW 
2m ; 


there is one bound state with l = 1 if Tr 2 h 2 /2m < Yoa 2 ^ 2nr 2 h 2 /m, etc. 

The smallest value of Yoa 2 for which there exists a bound state with 
l = 1, is greater than the corresponding value of Yoa 2 for l = 0; this is 
reasonable from a physical point of view. The interpretation in Sec. 14 
of the l term in the radial wave equation as an additional potential energy, 
which corresponds to the repulsive “centrifugal force,” suggests that a 
particle possessing angular momentum requires a stronger attractive 
potential to bind it than a particle with no angular momentum. Indeed, 
it turns out that the minimum square well potential “strength” V 0 a 2 
required to bind a particle of orbital angular-momentum quantum num¬ 
ber l increases monotonically with increasing l. 1 


16. THE HYDROGEN ATOM 

The potential energy Y(r) = — Ze 2 /r, which represents the attractive 
Coulomb interaction between an atomic nucleus of charge +Ze and an 
electron of charge — e, provides another wave equation that can be 
solved analytically. This problem is of direct physical interest, since 
apart from relativistic effects (see Chap. XII), the calculated energy 
eigenvalues are in agreement with the observed energy levels of the hydro¬ 
gen atom (Z = 1), the singly charged helium ion (Z = 2), etc. 

Reduced Mass. The Schrodinger wave equation developed in Sec. 6 
describes the motion of a single particle in an external field of force. 
Now, however, we are interested in the motion of two particles (nucleus 
and electron) that are attracted to each other by a force that depends 
only on the distance between them. The form of the wave equation to 
be used for two particles is suggested by the extension of the wave 
equation from one to three dimensions that was given in Sec. 6. This 
extension involved making the wave function depend on the three 
rectangular coordinates x , y , and z instead of just on x } and introducing 

1 It oan be shown that bound states appear with zero energy for a particular l 
value when [(1 /R)(dR/dr)] r -a - -(Z + l)/o; for l > 0 this is equivalent to the 
condition «=* 0 where now {■ « (2mV 0 a 3 /ft*)l. 
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the momenta corresponding to the new coordinates as they appear in the 
classical expression for the energy. 

A similar extension from three to six rectangular coordinates leads 
directly to the Schrodinger wave equation for two particles of masses 
m>i and ra 2 : 


8 ,/ a r ^ . ** , a 2 \ 

= [- 2 ^-^ + w + T z *) 

/ Q 2 Q 2 Qi \ I 

where the potential energy is assumed to depend in an arbitrary manner 
on all six coordinates and the time. If now the potential energy depends 
only on the relative coordinates, so that V = V(xi — x 2} yi — y 2) — z 2 ) f 

an important simplification can be made. We define relative coordinates 
x,y,z and coordinates of the center of mass X,Y,Z by 


x = X\ — x 2j 


V = ?/i - V 2, 


Z = Z\ — z 2 


MX = miXi + m& 2) MY = + rn 2 y 2 , MZ = mi«i + m 2 z 2 

Here, Af = mi + m 2 is the total mass of the system. Equation (16.1) 
can be rewritten in terms of the new coordinates 


= f__^(A s _ + . a4 . + _ 9 ! N ) 
L 2M\dX*^ azy 

- t l(< r L + 

2 m W 


?_ 4 - *1 + i! 

d.r 2 cty 2 cte 1 


3 ) + 


* (16.3) 


= . (16.4) 

mi + m2 

is called the reduced mas, s*. 

Two separations of the wave equation (16.3) can now be made. First, 
the time dependence can be separated out, as in Sec. 8; and second, a 
separation can be made info a product of functions of the relative coordi¬ 
nates and eeiiter-of-mass coordinates. The process is straightforward 
and simple, and results in 

i(N + R')t 

= u{x, y ,z)U(X,Y,Z)e * 

i rr 1.1 . / 1 a E\ 


V £ u + Vu = Eu 


(16.5) 


- rn ™ - E ' u 

where the v 2 operators in the second and third equations imply differ¬ 
entiation with respect to the relative and ccnfcr-of-mass coordinates, 
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respectively. The second of Eqs. (16.5) describes the relative motion of 
the two particles, and is the same as the equation for the motion of a 
particle that has the reduced mass /x in an external potential energy V. 
The third of Eqs. (16.5) tells us that the center of mass of the system of 
two particles moves like a free particle of mass M. 

In the hydrogen-atom problem, we shall be interested in the energy 
levels E associated with the relative motion. In this case, the reduced 
mass n is only slightly smaller than the electronic mass, since atomic 
nuclei are far more massive than electrons. 

Asymptotic Behavior. The separation of the relative motion in 
spherical coordinates is made as in Sec. 14. The radial equation that 
corresponds to the angular-momentum quantum number l is then 


2 d]R\ 
2ii r 2 dr \ dr ) 



1(1 + 1 )h 2 

2 flT 2 


R = ER 


(16.6) 


where E < 0 for a bound state. We follow the polynomial method used 
in the treatment of the harmonic-oscillator equation that was given in 
Sec. 13, and first attempt to rewrite Eq. (16.6) in dimensionless form by 
introducing a dimensionless independent variable p = or. Unlike Eq. 
(13.1), however, where the leading term for large x was the potential 
energy term %Kx 2 , the leading term in Eq. (16.6) for large r is the eigen¬ 
value term E. We therefore choose a so that the E term becomes a 
fixed number; this makes the dominant asymptotic behavior of the solu¬ 
tion independent of the eigenvalue. We rewrite Eq. (16.6) as 


1 ± ( 2 M i A _ 1 
?dp\ p dp)^\p 4 


W±i>) K -o 


(16.7) 


where the particular choice of the number i for the eigenvalue term is 
arbitrary but convenient for the following development. Comparison 
of Eqs. (16.6) and (16.7) shows that 



As with the harmonic-oscillator equation, we first find the dominant 
behavior of R(p) in the asymptotic region p —► oo. For sufficiently 
large p, it is apparent that R(p) = p n 6 >:t ^ satisfies Eq. (16.7) so far as the 
leading terms (which are of order R) are concerned, when n has any 
finite value. This suggests that we look for an exact solution of Eq. 
(16.7) of the form 


R(p) = F{p)c-lo 


( 16 . 9 ) 
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where F(p) is a polynomial of finite order in p. Substitution of Eq. (16.9) 
into Eq. (16.7) gives as the equation for F(p ) 



1(1 + 1) 

P 2 


F = 0 


(16.10) 


where primes denote differentiation with respect to p. 

Energy Levels. We now find a solution for F in the form 


F(p) = p'(a n + flip + Gap 2 + • • • ) 

= p*L(p), Go 0, s ^ 0 (16.11) 

This is necessarily finite for p = 0. Substitution of Eq. (16.11) into Eq. 
(16.10) gives as the equation for L 

p 2 L" + p[2(s + 1) - p]/7 + [p(X - s - 1) + a(* + 1) - Z(i + l)]i = 0 

If p is set equal to zero in this equation, it follows from the form of L 
implied by Eq. (16.11) that s(s + 1) - 1(1 + 1) = 0. This quadratic 
equation in s has two roots: a = l and s = — (i + 1). The boundary 
condition that Rip) be finite at p = 0 requires that we choose s = l. 
The equation for L then becomes 

P L" + [2(1 + 1) - P ]L' + (X - l - 1)L = 0 (16.12) 

Equation (16.12) can be solved by substituting in a power series of the 
form indicated by Eq. (16.11). The recursion relation between the 
coefficients of successive terms of the series is readily seen to be 


v + l + 1 — X 

a ' +1 “ (7+T)(7"+ 2T+2) 


(16.13) 


If the series does not terminate, its dominant asymptotic behavior can 
be inferred from the coefficients of its high terms: 


a » i-i _ > 1 

Cl y j> - > 00 V 

This ratio is the same as that of the series ior p"c f with any finite value oi 
n. Equations (16.9) and (16.11) show that this bohavior for L violates 
the boundary condition on It for largo p. 

Thus the series for L must torminato. If the highest power of p in 
L is p’*'(n' 5: 0), we must choose X equal to a positive integer n, 1 such that 

X -» = «' + f + 1 (16.14) 


1 The result. that the allowed values of X are integers, rat, her than multiples of 
integers, derives from the e.hoiee of i for the eigenvalue term in the dimensionless 
radial wave equation (1G.7). 
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n' is called the radial quantum number and n the total quantum number. 
Since n' and l can take on positive integer or zero values, n can have the 
values 1, 2, . . . . The energy eigenvalues are given by Eq. (16.8) 

*■ - - i b »i - - seS ('«•'*) 

in agreement with the old quantum theory and with experiment. Unlike 
the square well potential problem considered in Sec. 15, the Coulomb 
field problem gives rise to an infinite number of discrete energy levels 
extending from — /*Z 2 e 4 /2A 2 up to zero, for any finite value of Z. This is 
due to the slow decrease in magnitude of the Coulomb potential at large r. 

Laguerre Polynomials. The physically acceptable solutions of Eq. 
(16.12) with X = n may be expressed in terms of the Laguerre poly¬ 
nomials L q (p) y which can be defined in terms of a generating function 

_ pa 

p 1—a 

U( P ,s) = 

DO 

S<1 (16 ' 16) 

Differentiation of the generating function with respect to p and s leads to 
relations that are analogous to Eqs. (13,11) for the Hermite polynomials 
and (14.11) for the Legendre polynomials 


L q qLg—i ~ qL q -i 
L q +1 = (2# + 1 — p)L q — q 2 L q ~i 


(16.17) 


The lowest order differential equation involving only L q that can be 
constructed from Eqs. (16.17) is easily seen to be 


P^q + (1 — p)L q + qL q = 0 (16.18) 

Equation (16.18) resembles Eq. (16.12) but is not quite the same. We 
define the associated Laguerre polynomial 

* 

L?(p) = g,L 5 ( P ) (16.19) 

on differentiating Eq. (16.18) p times, it is seen that L v q (p) satisfies 

pi?" + (p + 1 - p)L*' + (q - p)L* = 0 (16.20) 

Comparison of Eq. (16.12) with X = n and Eq. (16.20) shows that the 
desired polynomial solutions are the associated Laguerre polynomials 
^n+iip) i which are of order (n + 0 — (2Z + 1) « n — l — 1 in agreement 
with Eq. (16.14). 
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Differentiation of Eq. (16,16) p times with respect to p gives the 
generating function for the associated Laguerre polynomials, 


U P ( P ,s) 


(~«) p e = V (p) „ 

(1 - s)^ 1 Af q\ 


(16.21) 


The following explicit expression may be verified by substituting it 
into Eq. (16.21) with n + l = q and 21 + 1 = p, and interchanging the 
order of the two summations: 


W(p) 


n-I-l 

- 2 <- 


i)*+i 


[(n + Oj]V 




(» — l — 1 — /r)l(2Z + 1 + k)\k\ 


(16.22) 


Hydrogen-atom Wave Functions. The radial wave function is of the 
form e~h‘p l L%fa l (p). The normalization constant may be found by using 
the generating function to evaluate the integral 


/> 


c->p»[L*#(p)}ydp = ^^ 1) 1 


(16.23) 


Thus the normalized energy eigenfunctions for the hydrogen atom are 
IlnlmiX = Rnli^Y lm(@ j^) 

S -M - - (Qs)‘ wfot'F -‘ViSS-Cp) (10.24) 


<Z() — 


w 

—y 
lie- 


2Z 

p - •— r 
na o 


where Yi m (0,<t>) is the normalized spherical harmonic given in Eq. (14.16); 
rta is the radius of the first (circular) Bohr orbit for hydrogen (Z = 1) 
in the old quantum theory. The energy levels (16.15) may be written 

ZV 


K = - 


2 a 0 n 3 


The first three radial functions, which arc found from Eqs. (16.22) 
and (16.24), are 


Ru)(r) 


Bao(r) 


Ru(r) 



Zr_ 

2ao 
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A much more complete set of these functions, with graphs of some of 
them, is given by Pauling and Wilson. 1 

It is interesting to note that each of the eigenfunctions for which 
l = 0 has a discontinuous gradient at r = 0, since dR^/dr ^ 0 there 
and Foo is independent of 6 and <j>. This is a consequence of the infinite 
potential energy at that point, as can be shown by means of a limiting 
process similar to that used in Sec. 8 to derive the boundary conditions 
at a perfectly rigid wall. 

Degeneracy. The energy eigenvalues (16.15) depend only on n, and 
so are degenerate with respect to both l and m. Thus for each value of 
ft, l can vary from 0 to n — 1, and for each of these l values, m can vary 
from —l to +Z. The total degeneracy of the energy level E n is then 

V (2 1 + 1) = 2 W(n ~ 1} + n = n* 

1 = 0 

It follows from the discussion of Sec. 14 that the degeneracy with 
respect to m is characteristic of any central force field, for which V 
depends only on the radial distance r from some point. The l degener¬ 
acy, however, is characteristic of the Coulomb field, as distinguished from 
most other central force fields. In some problems, such as the motion of 
the valence electron of an alkali atom, the potential energy of the electron 
is central, but only approximately of the Coulomb form. This pre¬ 
vents the ft energy levels that have the same total quantum number ft 
and different l from being coincident, so that the nth hydrogen-like level 
splits up into n distinct levels. If also some external field (such as a 
magnetic field) that destroys the spherical symmetry is imposed, the 
(21 + l)-fold m degeneracy disappears, and the nth hydrogen-like level 
is split up into n 2 distinct levels. 

The existence of degenerate energy eigenvalues means that linear 
combinations of the corresponding eigenfunctions are solutions of the 
wave equation with the same energy. In the case of the m degeneracy, 
such linear combinations of the spherical harmonics Yi m (6,4>) can be 
found that correspond to a new choice of the polar axis. It is reasonable 
to expect that linear combinations of the degenerate hydrogen-atom 
eigenfunctions that have the same n and different l exist that also corre¬ 
spond to some new choice of the coordinates. This is, in fact, the case, 
since it turns out that the hydrogen-atom wave equation can be .separated 
in parabolic coordinates. In general, degeneracy will occur whenever 
the wave equation can be solved in more than one way (in different 


1 Pauling and Wilson, op. cit Sec. 21. 
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coordinate systems, or in a single coordinate system oriented in dif¬ 
ferent ways), since if there were no degeneracy the wave functions 
obtained in the different coordinate systems would have to be identical 
except for a multiplying constant, and that is usually not possible. For a 
general central field, an exception occurs when l — 0 , since then the wave 
function is spherically symmetric and has the same form for all orienta¬ 
tions of the polar axis, so that there is no degeneracy. A similar excep¬ 
tion occurs in the hydrogen atom problem when n — 1 , in which case 
it turns out that the solutions obtained by spherical and parabolic 
separation of the wave equation are identical. 

Separation in Parabolic Coordinates. The parabolic coordinates 
£,? 7,0 are given in terms of the spherical polar coordinates by the relations 

£ = 7 - — z = r(l — cos 0 ) 

v = r + z = r(l + cos 0 ) (16.25) 

4> = <t> 


The surfaces of constant £ arc a set of confocal paraboloids of revolution 
about the z or polar axis, with focus at the origin, that open in the direc¬ 
tion of positive z or 0 = 0. The surfaces of constant rj are a similar set 
of confocal paraboloids that open in the direction of negative z or 0 = i r. 
The surfaces of const,ant 0 are the same as in the spherical coordinate 
system: planes through the polar axis. 

The wave equation for the hydrogen atom in parabolic coordinates is 



E < 0 (10.26) 


The separation is accomplished by substituting 

v/(£,t?,4>) = /(£)(/(* 7) f M<£) 

into Mq. ( 10 . 20 ) and dividing through by it) the 0 part of the equation 
separates at once: 


€ + nL /'/«v 



^ Wtt + V) 


2v\E\b 

h 1 


1 r/ 2 <h 
r/0 2 


(10.27) 


Since the left side of Iflq. (10.27) depends only on £ and 77 , and the right, 
side only on 0 , both sides must be equal to a constant that, we call m' 2 ; in 
accordance with the discussion of Hec. 14, this gives normalized 0 solu- 
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tion.8 that are the same as (14.8) 


m = 0, + l,±2, . . . (16.28) 

The rest of Eq. (16.27) can be separated into £ and ij parts: 

1 d ( t df\ _m? _ fi\E\ iiZe 1 

4£ 2ft 2 ft 2 

*«•> 

where the separation constant v is to be determined by the boundary 
conditions. Thus the equations for / and g are 


d 

d£ 




(16.30) 


Since these two equations are of the same form, and differ only in their 
constant terms, it is sufficient to solve one of them. 

Energy Levels. The first of Eqs. (16.30) may be solved by the 
method used to solve (16.6). The substitution {* = a£ puts it into the. 
dimensionless form 


11 (, £) + (?s _ I -2l)/. 0 
tdtVtt/ \r 4 vv J 

if we choose the parameters a and Xi to be given by 



(16.31) 


(16.32) 


The second of Eqs. (16.30) is also of the form (16.31) if we put f = ari 
with a given by (16.32); Xi is replaced by 


Xj = (16.33) 

We now treat Eq. (16.31) as we did (16.7). The asymptotic behavior 
is dominated by the factor e* f , where we must take the minus sign in 
the exponent. The series that multiplies this starts with a term £*, 
where it is readily shown that s = ±im. We therefore substitute 

/(f) = erhtpb »“L(f) (16.34) 

into (16.31) and obtain as the equation for L 

fL" + (|m| + 1 - t)L' + [\x - -KM + 1)]L = 0 (16.35) 
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As with Eq. (16.12), the nonterminating solutions for L cause the wave 
function (16.34) to becomes infinite for large {*. The terminating solu¬ 
tions are the associated Laguerre polynomials; comparison of Eqs. (16.20) 
and (16.35) shows that they are where 

tu = \i - i(|m| + 1) (16.36) 


is a positive integer or zero. 

In similar fashion, the solution of the ?j equation shows that the 
number 

n 2 - X 2 - £(|m[ + 1) (16.37) 

is a positive integer or zero. From Eqs. (16.36) and (16.37) we obtain 
Xi + X 2 == i + n 2 + |tm| -|“1 = ti (16.38) 


where n is a nonzero positive integer. The energy levels are given by 
combining Eqs. (16.32), (16.33), and (16.38): 



2 fx 


nZ*c* 
2 h*n* 


in agreement with Eq. (16.15). The energy level E n is degenerate, since 
according to (16.38) there are various ways in which the three quantum 
numbers ni, n 2 , and m can be combined to make up n. For m = 0, there 
are n ways of choosing n\ and n 2 ; for \m\ > 0, there are two ways of 
choosing m (= + |m|), and n — \m\ ways of choosing ni and n 2 . Thus the 
total degeneracy of the energy level E n is 

n-1 

n + 2 y (n — |m|) = n + 2 
i 

in agreement with the earlier result. 

Wave Functions. It is clear from the foregoing discussion that the 
unnormalized hydrogen-atom wave functions in parabolic coordinates are 

Wn,n, m (£,1?,4>) = /,^_ w (a£)/,' n "ii m | 

nZc 2 

“ A*(ni + ». +|m| + 1) 


n(n — 1) — 


n(n 


1)1 

— 1 = n 2 


For a particular energy level E n and magnetic quantum number ra(n > |m|), 
the parabolic quantum numbers ni and n 2 can be chosen such that 
ni + n 2 = n — \m\ — 1; that is, in n — |m| different wayB. Similarly, 
for given n and m, the azimuthal quantum number l in the spherical 
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solution can be chosen such that |m| g l ^ n — 1 , and so also in n — |m| 
different ways. Thus the n — \m\ products of the f and 77 functions are 
linear combinations of the n — |m| products of the r and 6 functions. 

The ground-state energy level provides a particularly simple illus¬ 
tration of the connection between the parabolic and spherical solutions. 
In this case, ni = n 2 = m = 0 , and the parabolic solution is simply 

pZeKt+v) fiZe*r 

e 2111 . Also, n — 1 , l = m = 0 , and the spherical solution is e h% . 

It is apparent from Eq. (16.25) that these two solutions are identical. 


Problems 


1. Apply the Bohr-Sommerfeld quantization rules (see Sec. 2) to the determina¬ 
tion of the energy levels of a harmonic oscillator and of the circular orbits in a hydrogen 
atom. Compare with the results obtained in this chapter. 

2. What is the order of magnitude of the spread of quantum numbers and energies 
of the states that contribute significantly to the oscillating-wave-packet solution for 
the harmonic oscillator? 

3 . Use the generating function for the Hermite polynomials to evaluate 



Un{x)x 2 u m (x)dx 1 


where the u 's are normalized harmonic-oscillator wave functions. 

4 . Use the generating function for the Legendre polynomials to evaluate 


5. Obtain an approximate analytic expression for the energy level in a square well 
potential (l - 0) when V Q a 2 is slightly greater than ir 2 ft a /8m. 

6. Show that for a square well potential the values of Voa 2 that just bind new 

energy levels with an l value greater than zero are given by ft 2 3 2 / 2 m, where the numbers 
z are the nonvanishing solutions of the equation = 0 (see footnote 1, page 80). 

7 . Assume that the interaction between the neutron and the proton that make up 
a deuteron can be represented by a square well potential with a = 2.80 X 10“ 13 cm. 
If the lowest (l =* 0) energy level of the system is —2.18 Mev (million electron-volts), 
calculate Vo in Mev to three significant figures. How does the answer compare with 
that which would be obtained from the approximate formula derived in Prob. 5? 

r 

8. Consider Eq. (14.17) with l = 0 and V(r) = -TV *. Change variables 

_ 

from r to z =■ e 2a } and show that Bessel’s equation results. What boundary condi¬ 
tions are to be imposed on z, and how can these be used to determine the energy levels? 
What is the lower limit to V 0 for which a bound state exists? 

9. Find expressions for the eigenfunctions and energy levels of a particle in a 
two-dimensional circular box that has perfectly rigid walls. 

10. It is shown in Sec. 9 that a one-dimensional square well potential has a bound 
state for any positive Voa 2 , and in Sec. 15 that a three-dimensional square well poten¬ 
tial has a bound state only for Voa 2 > Tr B A a /8m. What is the analogous situation for a 
two-dimensional square well potential? What, if any, is the physical significance of 
these results? 
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11. The Schrodinger equation for a rigid body that is constrained to rotate about a 
fixed axis and that has a moment of inertia I about this axis is 

%n at " 21 

where is a function of the time t and of the angle of rotation about the axis 

What boundary conditions must bo applied to the solutions of this equation? Find 
the normalized energy eigenfunctions and eigenvalues. Is there any degeneracy? 

12. Find the energy levels of a three-dimensional isotropic harmonic oscillator 
(V{r) = J/vr 2 ), by solving the wave equation in cartesian coordinates. What is the 
degeneracy of each level? Show that this equation can alBO be separated in spherical 
and in cylindrical coordinates. 

13. Show that the expectation value of the potential energy of an electron in the 
nth quantum state of a hydrogen atom is —^ 2 e 2 /a t) n 2 . From this result, find the 
expectation value of the kinetic energy. 

14. Find the normalized hydrogen-atom wave functions in parabolic coordinates 
for n = 2, m = 0. Express them ob linear combinations of the corresponding wave 
functions in spherical coordinates. 

16. Discuss the parities, if any, of the liydrogen-atom wave functions in parabolic 
coordinates. 
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solution can be chosen such that |m| g l ^ n — 1, and so also in n — \m\ 
different ways. Thus the n — \m\ products of the £ and rj functions are 
linear combinations of the n — \m\ products of the r and 0 functions. 

The ground-state energy level provides a particularly simple illus¬ 
tration of the connection between the parabolic and spherical solutions. 
In this case, tti = n 2 = ra = 0, and the parabolic solution is simply 

nZeH£+ v ) nZe*r 

e 2ii1 . Also, n = 1, l = m = 0, and the spherical solution is e h * . 

It is apparent from Eq. (16.25) that these two solutions are identical. 


Problems 

1. Apply the Bohr-Sommerfeld quantization rules (see Sec. 2) to the determina¬ 
tion of the energy levels of a harmonic oscillator and of the circular orbits in a hydrogen 
atom. Compare with the results obtained in this chapter. 

2. What is the order of magnitude of the spread of quantum numbers and energies 
of the states that contribute significantly to the oscillating-wave-packet solution for 
the harmonic oscillator? 

3 . Use the generating function for the Hermite polynomials to evaluate 


/: 


UnWxhlmWdx, 


where the u’a are normalized harmonic-oscillator wave functions. 

4 . Use the generating function for the Legendre polynomials to evaluate 


A 


Pi (w)Pi'{w)dw. 


6. Obtain an approximate analytic expression for the energy level in a square well 
potential (l = 0) when V Q a 2 is slightly greater than tt 2 h 2 /8m. 

6. Show that for a square well potential the values of W-i 2 that just bind new 

energy levels with an l value greater than zero are given by fi 2 z i /2m, whore the numbers 
z are the nonvanishing solutions of the equation = 0 (see footnote 1, page 80). 

7 . Assume that the interaction between the neutron and the proton that make up 
a deuteron can be represented by a square well potential with a = 2.80 X lO -13 cm. 
If the lowest ( l — 0) energy level of the system is —2.18 Mev (million electron-volts), 
calculate Vq in Mev to three significant figures. How does the answer compare with 
that which would be obtained from the approximate formula derived in Prob. 5? 

r 

8 . Consider Eq. (14.17) with i = 0 and V(r) - — J'V n . Change variables 

__ r 

from r to z = e 2o , and show that Bessel’s equation results. What boundary condi¬ 
tions are to be imposed on 2 , and how can these be used to determine the energy levels? 
What is the lower limit to V Q for which a hound state exists? 

9. Find expressions for the eigenfunctions and energy levels of a particle in a 
two-dimensional circular box that has perfectly rigid walls. 

10. It is shown in Sec. 9 that a one-dimensional square well potential has a bound 
state for any positive Wj 2 , and in Sec. 15 that a three-dimensional square well poten¬ 
tial has a bound state only for V Q a 2 > tW/ 8m. What is the analogous situation for a 
two-dimensional square well potential? What, if any, is the physical significance of 
these results? 



Skc. 10] 


DISCRETE EIGENVALUES: ENERGY LEVELS 


91 


11. Tim SchrOdingor equation for a rigid body that is constrained to rotate about a 
lixod axis and that has a moment of inertia I about this axis is 

d 2 \]/ 

“ 27 

where is a function of the time t and of the angle of rotation 4> about the axis 

What boundary conditions must be applied to the solutions of this equation? Find 
the normalized energy eigenfunetions and eigenvalues. Is there any degeneracy? 

12. Kind the energy levels of a three-dimensional isotropic harmonic oscillator 
(rut 1 /\r a ), by solving the wave equation in cartesian coordinates. What is the 
degeneracy of each level? Show that this equation can also be separated in spherical 
and in cylindrical coordinates. 

13. Show that the expectation value of the potential energy of an electron in the 
/Uli quantum slate, of a hydrogen atom is — Z 2 e J /a 0 n 2 . From this result, find the 
expectation value of the kinetics energy. 

14. Ki nd the normalized hydrogen-atom wave functions in parabolic coordinates 
for a - 2, m -=* t). Kxprcss them ftB linear combinations of the corresponding wave 
functions in spherical coordinates. 

15. 1 >iscuss the parities, if any, of the hydrogen-atom wave functions in parabolic 
coordinates. 



CHAPTER V 

CONTINUOUS EIGENVALUES: COLLISION THEORY 


Problems for which the energy eigenvalues are continuously dis¬ 
tributed usually arise in connection with the collision of a particle with a 
force field. The method of approach is different from that employed in 
the preceding chapter. There the boundary conditions at great distances 
were used to determine the discrete energy levels of the particle. In a 
collision problem, the energy is specified in advance, and the behavior of 
the wave function at great distances is found in terms of it. This 
asymptotic behavior can then be related to the amount of scattering of 
the particle by the force field. 

As in Chap. IV, the relatively few exact solutions that are obtained 
here are of wider application than might at first seem to be the case, 
since they can serve as foundations for approximate calculations on more 
complicated systems. It is interesting to note that the study of collisions 
is particularly important in connection with atomic nuclei (see Sec. 41) 
where relatively little information can be obtained in other ways. 


17. ONE-DIMENSIONAL SQUARE POTENTIAL BARRIER 
We consider first the one-dimensional collision of a particle with the 

square potential barrier V ( x ) shown in 
Fig. 14. In this problem we are inter¬ 
ested in a particle that approaches from 
the region of negative x and is reflected 
or transmitted by the barrier. In the 
corresponding classical problem, the 
particle is always reflected if its energy 
is less than that of the top of the barrier, 
and always transmitted if its energy is 
greater. We shall see that in the 
quantum problem, both reflection and 
transmission occur with finite proba¬ 
bility for most energies of the particle. Because of the lack of symmetry 
between positive and negative x that is introduced from the beginning, 
it is disadvantageous to deal with solutions that have definite parities, 
and so there is no reason for making V(x) symmetrical about x = 0, as 



Fig. 14. One-dimensional square po¬ 
tential barrier of height Vo and thick¬ 
ness a. 
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was done in Sec. 9. We assume, therefore, that V(x) = 0 for x < 0 and 
x > a, and V f x) = F 0 for 0 < x < a, where Vo is positive. 

Asymptotic Behavior. We are interested in representing a particle 
that approaches from the left with energy E > 0, and may be turned back 
by the potential barrier or penetrate through it. Thus the asymptotic 
behavior (in the regions where V (, x ) = 0) is as follows: for x < 0, we want 
the wave function to represent a particle moving to the left (reflected 
particle) as well as to the right (incident particle); f or # > a, we want the 
wave function to represent only a particle moving to the right (trans¬ 
mitted particle). 

A particle in a force-free region that is moving in a definite direction 
with a definite energy necessarily has a definite momentum, and hence 
can be represented by a one-dimensional momentum eigenfunction 

ipx 

u(x) a e h if the particle is moving in the positive x direction with the 

ipx 

momentum p, and u(x) & e h if the particle is moving in the negative x 
direction with the same energy. Thus since the wave equation in the 
regions where V(x) — 0 is 


h 2 d 2 u 
2m dx 2 


Eu 


our asymptotic solutions are 


u(z) = Ac ik * + Bcr* x , x ^ 0 f 

u(x) = Ce ik *, x § a (1 ' 

where A* = p/h == (2mB/ft 2 )Ms the magnitude of the propagation number. 
The solutions (17.1) arc appropriate for the force-free regions that are 
external to any scattering potential, whether or not it has the simple form 
shown in Fig. 14. 

Normalization. The physical meaning of the coefficients A, B , and C 
can be inferred by substituting (17.1) into the one-dimensional form of 
the probability current density given by Eq. (7.3). 


S(x) « v(\A\ 2 - |£| 2 ), x < 0 
S(x) = t>|C| 2 x > a 


(17.2) 


where v = hk/m is the speed of a particle with propagation number k. 
Since these expressions arc independent of z, the discussion of Sec. 7 
shows that they may be interpreted as the neb flux (positive to the right) 
in the two regions. This interpretation is consistent with the statement 
above that A, B t and C are the amplitudes of the incident, reflected, 
and transmitted wave functions, respectively. 

The absolute normalization of the wave functions (17.1) is unim¬ 
portant for this problem; this is because we are interested only in the 



94 


QUANTUM MECHANICS 


[Chap. V 


ratios of |J3| 2 and |C| 2 to |A| 2 , which are respectively the reflection and 
transmission coefficients for the barrier. It is sometimes convenient, 
however, to normalize the incident wave function to unit flux; this cor¬ 
responds to taking A = 1/vK Such a normalization must not be inter¬ 
preted as indicating that u(x) represents more than one particle; rather 
it means that we choose a large enough number of systems [each described 
by u(x)] that are identical, independent, and nonoverlapping in the 
sense of Sec. 7, so that the total incident flux in all of them is unity. A 
more precise hut sometimes less convenient normalization would assume 
a one-dimensional “box” of length L with periodic boundary conditions, 

and require that J \u(x)\Wx = 1. 

Scattering Coefficients. The character of the solution inside the 
potential barrier depends on whether E is greater or less than Vo. Sup¬ 
pose first that E > Vqj so that we can define a propagation number 
inside the barrier: a = [2m (E — Vo )/A 2 ]h Then the solution inside is 

u(x) = Fe iax + Ger ia *, 0 g x ^ a (17.3) 

The continuity of u and du/dx at x = 0 and x = (i required by the bound¬ 
ary conditions provides four relations between the five coefficients. We 
can eliminate F and 0, and solve for the ratios B/A and C/A . 

B __ (i k 2 - a 2 )( 1 - e 2iaa ) 

A ( , 

C _ 4 U} 

A (/<■ + a) 2 — (k ~ a) 2 C 2iaa 

The absolute squares of the ratios (17.4) are the scattering (reflection and 
transmission) coefficients 



1 + 
1 + 


4 kV 

(. k 2 — a 2 ) 2 sin 2 aa 
(/r 2 — a 2 ) 2 sin 3 aa 


Ak 2 a 2 


i \ E - Vo) 
V\ sin 2 aa 

. VI sin 2 aa 
, ^ 4E(E - Vo) 


(17.5) 


It is readily verified from (17.5) that \B/A\ 2 + \C/A\ 2 = 1 , as would be 
expected. 

Equations (17.5) show that the transmission coefficient approaches 



mFoa'V 1 
2 h 2 ) 


(17.6) 


when the particle energy approaches the energy of the top of the 
barrier (E —> 7 0 ). For increasing E (E > F 0 ), the transmission coefficient 
oscillates between a steadily increasing lower envelope and unity (see Fig. 
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15). There is perfect transmission when aa = 7r, 2*-, . . . ; i.e. } when¬ 
ever the barrier contains an integral number of half wave lengths. 1 
Interference phenomena of this type are well known in the transmission 
of light through thin refracting layers. 



Vo 

I’ia. 16. TranHinirtnion o.neflicienl of u Nqmire Imrrior us a function of partiole energy for 

mFooV* 5 = 8. 

The reflection and transmission coefficients for 0 < E < V Q are most 
easily obtained by replacing a by if3 in Eqs. (17.4), where 


2m(V (1 - A’)] 5 

P L J ' 


The result for the transmission coefficient is 


C 

A 


VI sinh 2 /3a 1 


(17.7) 


This decreases monotonieally from the value (17.0) as E decreases 
below Vd. When /3a » 1, the transmission coefficient (1.7.7) becomes 
very small and is given approximately by 


1C}E(V{) - 

vi " 


E) 


( ,~2fla 


(17.8) 


Figure 15 is a plot of the transmission coefficient computed from Eqs. 
(17.5) and (17,7) for a rather “opaque” barrier: mV^/h* = 8. 


1 Thin effect uIho occurs when F u < 0, in which case the square barrier heroines n 
square well. The seaUering eoeflic.ients are. given by (17.5) iT the sign <>r ho is changed 
there and in the expression for a. 
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18. COLLISIONS IN THREE DIMENSIONS 

We are primarily concerned in this chapter with collisions in three 
dimensions, in which a particle collides with a fixed force field, or two 
particles collide with each other. It was shown in Sec. 16 that the prob¬ 
lem of the nonrelativistic motion of two particles, when the only forces 
present depend on their relative positions, can be broken up into two one- 
particle problems, of which one describes the motion of the particles 
relative to each other or to their center of mass, and the other describes 
the free motion of the center of mass. While the center of mass can be 
taken to be at rest in calculating the energy levels of the internal motion, 
as in Sec. 16, it has a definite motion in a collision that cannot be ignored 
in calculating the outcome of such an experiment. This is because the 
customary laboratory procedure consists in bombarding a particle that 
is initially at rest with another particle that carries the total energy 
E 0 = E + E f of Eq. (16.5). Thus the energy E of the relative motion of 
the two particles is different from the bombarding energy E 0) and the 
observed scattering depends on whether the struck particle or the center 
of mass is initially at rest. 

We call the coordinate system in which the bombarded particle is 
initially at rest the laboratory coordinate system and the coordinate system 
in which the center of mass of the two colliding particles is (initially 
and always) at rest the center-of-mass coordinate system. It is evidently 
easier to calculate the result of a collision experiment in the center-of- 
mass system than in the laboratory system, since only 3 degrees of free¬ 
dom appear in the former as compared with 6 in the latter system. The 
collision process in the center-of-mass system may then be thought of as 
one in which a particle that has the reduced mass ix = 7nim 2 /(mi + m 2 ) 
of Eq. (16.4) and an initial velocity v collides with a fixed scattering 
center [see the discussion of Eq. (18.9) below]. The distribution in angle 
of the scattered particles will be affected by the transformation between 
the center-of-mass coordinate system, in which the calculations are made, 
and the laboratory coordinate system, in which the observations are 
made. 

Scattering Cross Section. The angular distribution of particles 
scattered by a fixed center of force or by other particles is conveniently 
described in terms of a scattering cross section. Suppose that we bom¬ 
bard a group of n particles or scattering centers with a parallel flux of N 
particles per unit area per unit time, and count the number of incident 
particles that emerge per unit time in a small solid angle Acj 0 centered 
about a direction that has polar angles 0 O and <£ 0 with respect to the 
bombarding direction as polar axis. This number will be proportional 
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to N t Uj and Acoq, provided that the flux is small enough so that there is 
no interference between bombarding particles and no appreciable dimi¬ 
nution of the bombarded particles by their recoil out of the target region, 
and provided also that the bombarded particles are far enough apart so 
that each collision process involves only one of them. 

Then the number of incident particles that emerge per unit time in 
Aw 0 can be written 

wiV<ro(0o,0o)Aa>o (18.1) 

where the proportionality factor <r o (0o,0o) is called the differential scatter¬ 
ing cross section. Since (18.1) has the dimensions of reciprocal time, 
0 * 0 ( 00 , 00 ) has the dimensions of an area. o-o(0o,0o)Aaj o is equal to the cross- 
sectional area of the parallel incident beam that contains the number of 
particles scattered into Acoo by a single target particle or scattering center. 
The integral of <ro(0o,0o) over the sphere is called the total scattering cross 
section 

cro ~ /cro(0Q,0o)r?cdo (18.2) 

For the collision of a particle with a fixed scattering center, the 
definition (18.1) of differential scattering cross section is equally valid in 
the laboratory and center-of-mass coordinate systems, since a scattering 
center that is fixed has an infinite effective mass and so the center of 
mass of the system does not move. For a collision between two particles 
of finite mass, however, the differential cross section (18.1) applies in 
general only to the laboratory coordinate system and to the observation 
of the scattered incident particle. It does not describe the observation of 
the recoil bombarded particle in the laboratory system, although it is 
of course possible to obtain a differential cross section for the recoil 
particle from cr o (0 o,0 o ). In the cent er-of-mass system the differential 
cross section <r(0,0) may be defined in analogy with (18.1), where again 
the scattered incident particle is the one that is observed and the flux N 
of the incident particle is computed with respect to the bombarded 
particle, not the center of mass. Since in this coordinate system the two 
particles move in opposite directions away from each other after the 
collision, it is clear that the differential cross section for observation of 
the recoil bombarded particle in the direction 0,0 is just o-(t — 0, 0 + x). 

Relations between Angles in the Laboratory and Center-of-mass 
Systems. The relation between the differential cross sections and angles 
in the laboratory system and in the center-of-mass system can be found 
by translating the laboratory system in the direction of the incident 
particle with sufficient speed to bring the center of mass to rest. Figure 
16a shows a particle of mass mi and initial speed v striking a particle of 
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mass m 2 that is initially at rest; the center of mass moves to the right with 
the speed v f = + m 2 ), as may be seen from the conservation of 



mx+m2 




Fig. 10. (a) Laboratory coordinat-o system, in which the target particle of moss ma is 
initially at rest. ( b ) Center-of-mass coordinate system, in which the center of mans is 
initially and always at rest, (c) Vector addition of velocity of center of mass in laboratory 
system (vO to velocity of observed particle in ccnter-of-masH system (v") to Rive velocity 
observed in laboratory system (vi); if v" < v do cannot exceed the hurIc sin -1 (v"/V)- 


momentum. Thus in the center-of-mass system the particles of masses 
mi and m 2 approach the center of mass with speeds 


V N = V — V 1 


m 2 v 

(mi + m 2 )’ 


and v r , respectively; they evidently recede from the center of mass after 
the collision with the same speeds [see Fig. 16(6)]. It follows from the 
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geometry of the situation that 0 and 0 are related to 0 O and 0 O by 

v" cos 0 + v f = Vi cos 0 O 
v" sin 0 = v i sin 0 O 
0 = 00 


(18.3) 


From the first two of Eqs. (18.3) we obtain on elimination of 


tan 0 o 


sin 0 

7 + cos 0 7 


xf __ m\ 
v" ~~ ra 2 


(18.4) 


Equations (18.3) and (18.4) can be generalized by considering a col¬ 
lision (for example, a nuclear reaction) in which a particle of mass mi 
strikes a particle of mass m 2 initially at rest, and after the collision, 
particles of masses m 3 and m\ emerge, where mi + m 2 = m a + m 4 . 
If also an amount of energy, Q , is converted from internal energy to 
kinetic energy of the emergent particles (Q is positive for exothermic and 
negative for endothermic collisions), and the particle of mass m a is 
observed, the first of Eqs. (18.4) is still valid. In this case y is still equal 
to the ratio of the speed of the center of mass in the laboratory system 
to the speed of the observed particle in the ccnter-of-mass system. How¬ 
ever, 7 is no longer rrii/m a, but can bo shown to be given by 


m rm 3 E V 

m 2 m.i E + QJ 


(18.5) 


where# = 7ni?nov ,2 /2(ni x + m 2 ) is the energy initially associated with the 
relative motion in the center-of-mass system [see the discussion of Eq. 
(18.9) below]. 

Relation between Cross Sections. The relation between the cross 
sections in the laboratory and center-of-mass coordinate systems can be 
obtained from their definitions, which imply that the same number of 
particles arc scattered into the differential solid angle dw (l about 0 o, 0 o as 
are scattered into dco about 0 , 0 . 


<Tn( 0 o, 0 n) sin 0 l) d 0 ( ) d 0 u = <r( 0 , 0 ) sin dd0d<j) (18.6) 


With the help of the last of Eqs. (L8.3) and the first of Eqs. (18.4), Eq. 
(18.6) gives 


O"n(0ii,0n) = 


(1 + y‘ z + 2 7 cos 0 ) 


| L + 7 cos 0 | 


- <7(0,0) 


(18.7) 


where in general 7 is given by Eq. (18.5). It should be noted that the 
total cross section is the same for both laboratory and center-of-mass 
systems, and also for both the outgoing particles, since the total number 
of collisions that take place is independent, of the mode of description of 
the process. 
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Dependence on 7 , For 7 < 1, Eq. (18.4) shows that 0 O increases 
monotonically from 0 to t as 0 increases from 0 to t. For 7 = 1 , 0 O = -J -0 
and varies from 0 to as 0 varies from 0 to 71 -; in this case 

cro(0o,<M = 4 cos 0o<7(20o, <M, 

and no particles appear in the backward hemisphere in the laboratory sys¬ 
tem. For 7 > 1 , 0 O first increases from 0 to a maximum value sin“ 1 (l/ 7 ), 
which is less than ^tt, as 0 increases from 0 to cos -1 ( — 1 / 7 ); 0 O then 
decreases to 0 as 0 increases further to 7 r. In this case o- o (0o,tf>o) is usually 
infinite at the maximum value of 0 O , although this singularity gives a 
finite contribution to the total cross section; no particles appear beyond 
the maximum 0 O in the laboratory system. The two values of 0 that 
give rise to a particular value of 0 O between 0 and sin " 1 (I/ 7 ) can be 
distinguished by the energy of the observed particle, which is greater 
for the smaller 0 . 

This last case (7 > 1) is illustrated schematically in Fig. 16c. The 
resultant of the velocity v" of the observed particle in the center-of-mass 
system and the velocity v' of the center of mass in the laboratory system 
gives the velocity Vi of the observed particle in the laboratory system. 
The locus of the terminal points of Vi when its origin is at the point 0 
is the circle of radius v ". Thus when v" < v' } the angle 0 O of the resultant 
Vi with the bombarding direction cannot exceed the angle 



As the ratio 7 = v f fv n decreases, the circle gets relatively larger and the 
angular range of Vi increases. 

The use of geometrical relationships in the foregoing discussion is 
valid in a quantum-mechanical system as well as in a classical system. 
This is because they are essentially relations between momentum vectors 
that are applied in the asymptotic region where the particles need not be 
precisely localized in space and hence can have definite momenta. 

It is interesting to note that the difference between laboratory and 
center-of-mass systems is negligible in the collisions of electrons with 
atoms, because of the large mass ratio of the colliding particles. In 
nuclear collisions, however, the difference between the two coordinate 
systems is usually significant. 

Asymptotic Behavior. The differential scattering cross section 
o*( 0 , 0 ) in the center-of-mass coordinate system can be found from the 
asymptotic form of the solution of the second of Eqs. (16.5), 
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which is the wave equation for the relative motion. The wave function 
u may be written as a function of the angles 6,<f> of Fig. 16(6) and the radial 
distance r between the two particles. From Eq. (16.4) the reduced maps 
is n = wiim 2 /(m 1 + m 2 ). The energy E associated with the relative 
motion is easily seen from Fig. 16 to be 


E = 


—So 
m i + m 2 


(18.9) 


where E 0 is the initial energy of the bombarding particle. It is interesting 
to note that E is equal to the kinetic energy of a particle whose mass is 
the reduced mass /z and whose speed is the relative speed v. Thus we can 
think of Eq. (18.8) as representing the collision of a particle of mass /x, 
initial speed v, and kinetic energy E = -JjLiy 2 , Avith a fixed scattering center 
that is described by the potential energy 7(r); then r is the vector distance 
from the fictitious particle /\x to the origin of the scattering potential. 

As in Sec. 17, the scattering is determined by the asymptotic form of 
w(r,0,<f>) in the region where 7 = 0. When the colliding particles are 
far apart, we Avant u to contain a part that represents an incident particle 
of mass ix moving in a particular direction with speed v } and a part that 
represents a radially outgoing particle: 


u(r,d,4>) -> A [<!»■ + r- , M*)«“ r L A- = £ (18.10) 

r—► oo fl 

The first term in Eq. (18.10) represents a particle moving in the positive 
z direction, or along the polar axis 0 = 0, since z = r cos 0; it is an infinite 
plane wave of the form of the momentum eigenfunction (11.2), where the 
propagation vector k has the magnitude k and is directed along the polar 
axis. The second term in Eq. (18.10) represents a particle that is moving 
radially outward; its amplitude depends on 0 and <£, and is inversely 
proportional to r since the radial flux must fall oil as the inverse square 
of the distance. It is readily verified that Eq. (18.10) satisfies the wave 
equation (18.8) asymptotically through terms of order l/r in the region 
in which 7 = 0, for any form of the function /(0,<£). 

Normalization. The physical meaning of the coefficient A and the 
angular function / can be inferred from a calculation of the particle flux, 
as in Sec. 17. A straightforward substitution of Eq. (18.10) into Eq. 
(7.3), however, yields interference terms between the incident and 
scattered waves that do not appear in most experimental arrangements; 
that they do not appear can be seen in the following way. 

In practice, the incident and scattered particles are separated from 
each other by collimating one or the other. Suppose, for example, that 
the experimental arrangement is as shown schematically in Fig. 17, so 
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that the bombarding particles from the source S are collimated by dia¬ 
phragms DD into a fairly well-defined beam. Such a collimated beam 
is not an infinite plane wave of the form e**, but can be made up 
by superposing infinite plane waves that have propagation vectors of 
slightly different magnitudes and directions. The total angular spread 
in radians will be of the order of the ratio of the wave length of the particle 
to the diameter of the collimating aperture, and so can be made extremely 
small in a practical case. Now/usually does not vaiy rapidly with angle, 
so that the small directional spread of the incident propagation vectors 
does not affect / significantly. Thus at the point of observation, P, only 
the/term is present, and it is essentially the same as that which appears 
in Eq. (18.10). The incident flux can be calculated from the plane wave 
term alone, since if we go far enough from the scattering region, the / 



Fig. 17. Sohematic diagram of a laboratory arrangement for the measurement of scatter¬ 
ing, in which there is no interference between the incident and scattered waves at the point 
of observation P. 

term can be made negligible. Thus in the region of observation, the 
interference terms are a consequence of the idealization implicit in assum¬ 
ing an infinite plane wave in Eq. (18.10), and usually have no physical 
significance. 1 

Substitution of the two terms of Eq. (18.10) separately into Eq. (7.3) 
shows that the incident flux is of magnitude A | 2 along the polar axis, 
and that the leading term in the scattered flux is of magnitude 

v\Ame,<t>)\*/r* 

along the outward radius. From the definition of cross section, it follows 
that 

= l/(M)l* (18.11) 

As discussed in Sec. 17, the choice of the coefficient A is unimportant 
so far as the calculation of the scattering is concerned. The wave func¬ 
tion may be normalized to unit incident flux by choosing A = 1/W, or it 
may be normalized by making f\u\ 2 dr = 1 over a large box that has 

1 For a somewhat exceptional case, see the discussion of TCq. (10.24) in the next 
section. 
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periodic boundary conditions. We shall often simply set A equal to 
unity. 

19. SCATTERING BY SPHERICALLY SYMMETRIC POTENTIALS 

The asymptotic behavior of the wave function determines the dif¬ 
ferential scattering cross section, but cannot itself be found without solv¬ 
ing the wave equation (18.8) throughout all space. As was the case with 
the calculation of energy levels considered in Chap. IV, this can be done 
only when the wave equation is separable, and a particular case of great 
physical interest is that in which the potential energy is spherically 
symmetric. We assume here that V is a function only of r, and find the 
connection between the solutions separated in spherical polar coordinates 
and the asymptotic form (18.10); this procedure is called the method of 
partial waves. 

In the remainder of this chapter we shall not, for the most part, 
distinguish between collisions of a particle with a fixed scattering center, 
and collisions between two particles treated in the center-of-mass coordi¬ 
nate system. 

Asymptotic Behavior. It is apparent that the problem now possesses 
symmetry about the polar axis, so that u, /, and a- are independent of the 
angle 0. The general solution of Eq. (18.8) then has the form (see Sec. 
14) 

□0 00 

u(r,d) = Y Ri(r)Pi(co$ 6) = Y r _1 xi(r)Pi(cos 6) (19.1) 

i-o z=o 


where Pi is the Legendre polynomial of order Z, and %i satisfies the equation 



- U(r) - 


1(1 + 1 ) 


U(r) = 


2 nV(r) 
h 2 


Xi = 0 
-> 0 


(19.2) 


The boundary condition at r = 0 that Hi be finite or %i vanish determines 
the asymptotic form of the solution (19.2) except for an arbitrary multi¬ 
plicative constant. 

In order to find the general nature of this asymptotic behavior, we 
consider r to be so large that the If and l terms in Eq. (19.2) can be 
neglected. Then the solution of Eq. (19.2) is one of the forms e ±ifcr . To 
get a better approximation, we put 

Xi(r) = v(r)c±*r (19.3) 


where v is assumed to be a slowly varying function of r for large r. Sub¬ 
stitution of (19.3) into (19.2) gives 
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v" ± 2 ikv' = [V(r) + v (19.4) 

where primes denote differentiation with respect to r. If now we neglect 
v” in comparison with Jcv\ since v is supposed to be slowly varying, Eq. 
(19.4) can be integrated to give 

±2ik In v = J j^?7(r) + ' • j dr + constant (19.5) 

Thus if U(r) falls off more rapidly than 1/r for large r, the integral on the 
right side of (19.5) converges and v approaches a constant as r becomes 
infinite. This justifies the assumption made above that v is slowly 
varying. The Coulomb field, for which U is proportional to 1/r, thus 
requires special attention, and will be discussed in Sec. 20. 

The asymptotic form of xi(r) can then be written quite generally 

Xi(r) -> A\ sin (At + 5j), (19.6) 

V'~+ oo 

where thus far and Sj can be complex. The solution of (19.2) that 
vanishes at r = 0 is unique except for a multiplying constant. It can 
be shown that this solution is real everywhere if it starts out to be real at 
r = 0, since A*, U , and l are all real. Thus 8[ must be real, although A\ 
need not be. This being the case, it is readily verified that the total 
radial flux of particles through a large sphere vanishes: 

lim 27rr 2 f* S r sin 9d6 = 0 

r—► oo J® 

where S r is the radial component of the vector (7.3) calculated by sub¬ 
stituting u(r f 6) from Eq. (19.1) into it. This means that there are no 
sources or sinks of particles present, and the particles that are scattered 
radially outward are supplied by the incident plane wave. 

Scattering Cross Section. It is convenient to redefine the amplitude 
Ai and phase angle that appear in Eq. (19.6) in terms of a somewhat 
more specialized problem. It will be assumed that U(r) can be neglected 
for r greater than some distance a ; in cases of practical interest, a may be 
small enough so that the l term in (19.2) is not negligible. For r > a, 
the most general form for Ri(r) that is real (except possibly for a complex 
multiplying constant) is shown in Sec. 15 to be 

Ri(r) = A 2 [cos &iji(kr) — sin ^(Ar)] (19.7) 

where 8i is real; according to (15.8), this has the asymptotic form 
Ri(r) -» (kr)~ l Ai sin (At — %It + <5 z ) 

J-- 00 

Equations (19.6) and (19.8) agree if Ai = kA\ and 8 t = 5j + ?lir. 


(19.8) 
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We now wish to identify the asymptotic form of (19.1) with (18.10). 
To do this, we require an expansion of e ihz = 003 6 in Legendre poly¬ 

nomials: 1 

as 

— y (21 + l)i l ji(kr)Pi(cos 9) (19,9) 

z-o 

Substituting the asymptotic form of (19.9) into (18.10) with A *= 1, 
and equating this to the asymptotic form of (19.1), we obtain 


£ (21 + ain (At - ih)Pt( cos 6 ),+ r~'j(d)e* r 

Z-0 


oo 



A\(kr)~ l sin (At — %hr + 8{)Pi (cos 6) 


When the sine functions are written in complex exponential form, the 
coefficients of e ihr and of e~ ikr on the two sides of this equation must be 
equal to each other: 


00 OO 

2 ikf(6) + y (21+ l)i l e~ ‘^/ J ,(coa 6) = V Aje^'-^PKcos 0) 

Z-0 Z-0 

CO 00 

y (2 1 + l)i l et ilr Pi(cos 8) = Y Aie~ i{Sl ~ V t) Pi (cos 6) 

Z=0 Z-0 


(19.10) 


Since these are true for all values of 6 and the Legendre polynomials are 
orthogonal to each other, the second of Eqs. (19.10) becomes 


ri, = (2 1 + 1 )iV* 

Substitution of this into the first of Eqs. (19.10) gives for the scattering 
amplitude 

oo 

f(6) = (2<fr)“ l y (2 1 + 1 )(c«* - l)i J z(cos 9) (19.11) 

z=o 

Thus the differential cross secrion is 


*i<» = i/wi 2 = i 


(21 + !)<:“' ain 5(P;(coa 0) 


(19.12) 


1 G. N. Watson, “Tlxsory of UcshcI Functions,” rev. od., p. 128 (Macmillan, Now 
York, 1944). 
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Because of the orthogonality of the Legendre polynomials, the total cross 
section contains no products of factors involving different values of l : 

oo 

c = 27t f <t(6 ) sin 8dB = V (21 + 1) sin 2 5* (19.13) 

* =o 

Phase Shifts. The angle is called the phase shift, of the Ztli partial 
wave, since according to (19.8) it is the difference in phase between the 
asymptotic forms of the actual radial function Ri(r), and the radial func¬ 
tion ji(kr) in the absence of scattering potential (U = 0). The phase 
shifts completely determine the scattering, and the scattering cross section 
vanishes when each of the 8i is 0 ° or 180°. 

It should be noted that the derivation of (19.11) is valid whether or not 
there exists the assumed radius a beyond which U(r) is negligible, pro¬ 
vided that U(r) falls off more rapidly than 1/r. However, the method 
of partial waves is most useful for computing scattering cross sections if 
such a radius a does exist, especially if ka is of the order of or less than 
unity. The reason for this is that the first and largest maximum of 
ji(kr) lies roughly at r = l/k, and that for r much smaller than this, ji is 
small and increases about as r 1 [see Eq. (15.7)]. Thus if a < 3 C l/k, ji will 
be very small where U is appreciable; then the Zth partial wave will 
hardly be affected by the potential, the phase shift 8i will be very small, 
and the contribution to the scattering from that l will be negligible. It 
follows then that the scattering cross section consists of a series of terms 
extending from l = 0 to a maximum l that is of the order of ka. Since 
the computation of the phase shifts is usually a tedious affair, the smaller 
the magnitude of ka the easier the method is to apply. Thus this method 
of partial waves is most useful at low bombarding energies. 

It is interesting to note that the classical distance of closest approach 
of a free particle of mass g, velocity v , and angular momentum Ih to the 
origin is lh/pv = l/k. Thus the foregoing remarks are analogous to the 
statement that a classical particle is not scattered if it has sufficient 
angular momentum so that it does not enter the potential region r < a. 

Calculation of 8 t . The phase shift 8 t is computed by fitting the radial 
wave function Ri(r) for r < a, which may have an analytic form and can 
always be found numerically if necessary, to the exterior solution (19.7). 
The boundary condition at r = a is that (\/Ri){dRi/dr) be continuous. 
Thus if 7 1 is the ratio of slope to value of the interior wave function, we 
have that 

k[ji(ka) cos 8i — n[(ka) sin fr ] _ 
ji(ka) cos 8i — ni(ka) sin 8i ^ 1 
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where the derivatives j\ and n\ may be rewritten with the help of (15.10). 
Then 8 t is given by 


tan 5z = 


kj[(ka) - yijijka) 
kn r i(ka) — 7 ini(ka) 


(19.14) 


Equation (19.14) can be used at once to obtain an approximate expres¬ 
sion for 8 i when l » ka and 8 i is expected to be small. In this case, 7 1 
will differ little from the ratio of slope to value of the solution in the 
absence of a scattering potential, so that we put 


7 1 


= k\i 
in 


j'i(ka) 

ji(ka) 


+ €Z 


M « 


fi(ka) 


ji(ka) 


Equation (19.14) can be rewritten with the help of (15.9) as 

€i(ka) 2 ji(ka) 


tan 81 = 


€z(A*a) 2 jz(/ra)nz(A-a) — 1 


(19.15) 


(19.16) 


which is still exact. If now we make use of the power series expansion 
for ji from (15.7) when ( ka ) 2 , and use (15.7) and (15.8) to estimate 

the order of magnitude of n h the inequality in (19.15) becomes 




(19.17) 


and (19.16) may be approximated as 


^__ 

- [1 ■ 3 ■ 5 • • ■ ( 2 1 + l )] 2 


ez2 2Z (Z!) 2 (A*a ) 21+2 

m + 1)!] 2 


(19.18) 


Equation (19.18) can be used to verify the convergence of the sums 
over partial waves such as appear in (19.11). We use Stirling's formula 
to find the leading terms in In \ 8 i\ when l is large, and neglect terms of 
order In l and lower. 

In \ 8 [\ = In |e*| + 2 l[ln (ka) + 1 — In 2 ] — 21 In l (19.19) 

Thus even if \ei\ lias the maximum value indicated by (19.17), 81 falls off 
like the inverse factorial of l (faster than exponentially), and the series 
that appear in the expressions for the scattering converge quite rapidly 
for large Z. 

Relation between Signs of 81 and V(r). It is apparent from (19.18) 
that when l » (ka) 2 , 81 has the opposite sign from e*. If now the potential 
energy term V or U is positive, corresponding to forces that are mainly 
repulsive, Eq. (19.2) shows that the ratio of curvature to value for the 
radial wave function is more positive than in the force-free case. This 
means that the ratio of slope to value is more positive at r = a than is 
the case if U = 0 . Thus a repulsive potential makes ei positive and 81 
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negative. A negative phase shift means that the radial wave function is 
“pushed out” in comparison with the force-free wave function. 

In similar fashion, we see that a negative potential makes negative 
and di positive. This means that the radial wave function is “pulled 
in” by the attractive potential. 




Fig. 18. Schematic plots of the effects of (a) positive (repulsive) potential, and (6) nega¬ 
tive (attractive) potential, on the force-free radial wave function ji(kr)\ the range of the 
potential is a in each case. Ri(r) is drawn arbitrarily to start out like ji(kr) at r — 0, and 
is bent up more rapidly in (a) so that it has a greater amplitude and a retarded phase 
(pushed out) with respect to ji(kr). In (6), Ri(r) bends over sooner, and thus has a smaller 
amplitude than ji(kr) and an advanced phase (pulled in). The amplitudes have no direct 
physical significance, whereas the phases determine the scattering. The difference between 
neighboring nodes of ji and Ri is not precisely equal to the phase shift divided by k (as 
indicated) until iz has gone through several oscillations and attained its asymptotic form. 

These conclusions are valid even when l is not large compared to ka 
and Si is not small. This may be seen graphically by comparing ji(kr) 
and Ri(r) when they are arbitrarily made to start out in the same way at 
r = 0. Figure 18(a) shows a schematic comparison for positive V f and 
Fig. 18(6) for negative V. 

Ramsauer-Townsend Effect The construction in Fig. 18(6) suggests 
that an attractive potential might be strong enough so that one of the 
radial partial waves is pulled in by just half a cycle and its phase shift is 
180°. If this were the case, the corresponding term in the expression 
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(19.11) for f(6) would vanish, and there would be no contribution to the 
scattering. It is clear from the foregoing discussion that the phase shift 
is largest for l = 0. The possibility then arises that ka can be small 
enough and the attractive potential large enough in magnitude so that 
So = 180° and all other phase shifts are negligibly small. In such a case, 
the scattered amplitude /(0) vanishes for all 0, and there is no scattering. 

This is the explanation 1 of the Ramsauer-Townsend effect , the ex¬ 
tremely low minimum observed in the scattering cross section of electrons 
by rare-gas atoms at about 0.7 electron-volt bombarding energy. 2 A 
rare-gas atom, which consists entirely of closed shells, is relatively small, 



Fig. 19. Schematic plot of the effect of the potential of a rare-gas atom of “radius” a on the 
Z = 0 partial wave of an incident electron that has the minimum cross section observed 
in the Ramsauer-Townsend effect. As in Fig. 18, the actual and forco-free wave functions 
start out in the same way at r = 0; the former is “pulled in” by 180° of phase. In an 
actual case, the quantity ka would be somewhat smaller than is indicated here. 

and the combined force of nucleus and atomic electrons exerted on an 
incident electron is strong and sharply defined as to range. Thus it is 
reasonable to expect that a situation such as that illustrated in Fig. 19 
could occur. Here the partial wave with l = 0 has exactly a half cycle 
more of oscillation inside the atomic potential than the corresponding 
force-free wave, and the wave length of the electron is large enough in 
comparison with.a so that higher l phase shifts arc negligible. It is clear 
that this minimum cross section will occur at a definite energy, since the 
shape of the wave function inside the potential is insensitive to the rela¬ 
tively small bombarding energy whereas the phase of the force-free wave 
function depends rapidly on it. 

Physically, the Ramsauer-Townsend effect may be thought of as a 
diffraction of the electron around the rare-gas atom, in which the wave 
function inside the atom is distorted in just such a way that it fits on 

1 This explanation, suggested by N. Bohr, was shown to be quantitatively reason¬ 
able by H. Faxdn and J. Iloltsmark, Zeits. f. Physik, 45, 307 (1927). 

2 The experimental results arc summarized by R. Kolia,th, Phys. Zrils., 31, 985 
(1931). 
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smoothly to an undistorted wave function outside. This effect is anal¬ 
ogous to the perfect transmission found at particular energies in the one¬ 
dimensional problem considered earlier [see discussion of Eq. (17.5)]. 
Unlike the situation in one dimension, however, the Ramsauer-Townsend 
effect cannot occur with a repulsive potential, since ka would have to be 
at least of order unity to make <5 0 = —180°, and a potential of this large 
range would produce higher l phase shifts. 

Scattering by a Perfectly Rigid Sphere. As a first example of the 
method of partial waves, we compute the scattering by a perfectly rigid 
sphere, which is represented by the potential V(r ) = + 00 for r < a, 
and V(r) =0 for r > a. The solution for r > a is just Eq. (19.7). The 
boundary condition, obtained in Sec. 8 , that u(a,9 ) = 0, is equivalent to 
the requirement that all the radial functions vanish at r = a. The phase 
shifts may then be obtained by setting either Ri(a) given by (19.7) equal 
to zero, or 7 1 in (19.14) equal to infinity: 

8 * -I® (19 - 20) 

The calculation of the scattering is particularly simple in the low- 
energy limit: ka = 27ra/X « 1. Then substitution of (15.7) into (19.20) 
gives as an approximation for the phase shifts 

tan Si Si - ( 2 Z + i)[i . l - 5 • • • (2 7 - 1)]* (19.21) 

Thus Si falls off very rapidly as l increases, in agreement with (19.19). 
All the phase shifts vanish as k —»0; however, the l = 0 partial wave 
gives a finite contribution to the scattering because of the factor 1 /A * 2 that 
appears in (19.12) and (19.13). We thus obtain 

<r(9) ^ a 2 , a* ~ 47 ra 2 (19.22) 

The scattering is spherically symmetrical, and the total cross section is 
four times the classical value. 

In the high-energy limit (ka ^>1), we might expect to get the classical 
result, since it is then possible to make wave packets that are small in 
comparison with the size of the scattering region, and these can follow 
the classical trajectories without spreading appreciably. This corre¬ 
sponds to the ray limit in the wave theory of light or sound. The dif¬ 
ferential scattering cross section is rather difficult to find, and we only 
indicate the computation of the leading term in the total cross section. 
Substitution of (19.20) into (19.13) gives 
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_ 4x V (2 1 + l)jf(ftg) 
jf(*a) + «J(*o) 

1 = 0 


(19.23) 


We can make use of asymptotic expansions of Bessel functions that are 
valid when the argument is large and the order is smaller than, of the 
order of, and larger than the argument . 1 The calculation shows that 
most of the contribution to the sum in (19.23) comes from 


l < (ka) - C(ka)* t 

where C is a number of order unity; the leading term here is i(ka) 2 . 
The other two parts of the sum, for (ka) — C(ka )* < l < (ka) + C(ka )*, 
and for l > (ka) + C(ka )*, each contribute terms of order (ka)*, and hence 
may be neglected in the high-energy limit. Thus 

<r S 27ra 2 (19.24) 


which is twice the classical value. 

The reason for the apparently anomalous result (19.24) is that the 
asymptotic form of the wave function is so set up in Eq. (18.10) that in 
the classical limit the scattering is counted twice: once in the true scatter¬ 
ing (which turns out to be spherically symmetric as it is in the classical 
problem), and again in the shadow of the scattering sphere that appears 
in the forward direction, since this shadow is produced by interference 
between the incident plane wave e ikz and the scattered wave f(B)e' ijBT /r. 
However, so long as ka is finite, diffraction around the sphere in the for¬ 
ward direction actually takes place, and the total measured cross section 
(if the measurement can be made so that it includes the strong forward 
maximum) is approximately 27ra 2 . 

Scattering by a Square Well Potential. As a second example of the 
method of partial waves, we consider the somewhat more complicated 
problem of the scattering from the spherically symmetric square well 
potential illustrated in Fig. 13 of Sec. 15. The interior (r < a) wave 
function that is finite at r = 0 is seen by analogy with Eq. (15.11) to be 


Ur) = 1U(ar), a = ( 19 . 25 ) 

Thus the phase shifts are given by Eq. (19.14), where the ratio of slope 
to value of the Zth partial wave at r = a is 


_ aj'iiaa) 

iUa) 


(19.26) 


1 Watson, op. at Chap. VIII. 
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In the low-energy limit (ka <3C 1), substitution of (15.7) into (19.14) 
gives for the first two phase shifts 


tan $ 0 = — 


yoka 2 
1 + 7oCt 


tan b i 


^ (fra) 3 1 - 7i« 

3 2 + yitt 


(19.27) 


Unless yoG = — 1 or yia = —2, both of these vanish as k —> 0. As with 
the rigid sphere, however, the l = 0 partial wave gives a finite contribu¬ 
tion to the scattering because of the factor 1/A* 2 that appears in (19.12) 
and (19.13). From Eq. (19.26), we see that y Q a = aa cot aa — 1, so that 

a & W ^1 - (19.28) 

The scattering is spherically symmetrical. 

The conclusion reached here and in connection with the rigid sphere 
that the low-energy scattering is substantially independent of bombard¬ 
ing energy and angle of observation is almost always valid for any 
potential that has a finite range. Exceptions can arise, as pointed out 
after Eq. (19.27), if any one of the yi is such that the denominator of 
the expression for tan 5 Z is very small. In such a situation, the Zth partial 
wave is said to be in resonance with the scattering potential; then it usu¬ 
ally dominates the scattering. 

Resonance Scattering. An approximate expression for the resonance 
cross section can be obtained by making use of the fact that yi decreases 
linearly with increasing a when a is sufficiently close to a 0 = (2/x7o/ft 2 )*- 
Increasing a causes the interior wave function to bend over more rapidly 
and so decreases the ratio of slope to value at r = a. Now 


a 


(«§ + F)» S a 0 + P- 

AOlp 


when k is small, so that we can write to lowest order in k 

y ia = y\a — bi(ka) 2 


where 7? is the value of 7z when a = a 0 , and bi is a positive number of 
order unity. 1 Substitution into (19.27) and then into (19.12) gives for 
the leading term in the differential cross section, in the two cases for 
which the value of Z for the partial wave that is in resonance is 0 and is 1, 


1 It can be shown that the Zth partial wave is exactly in resonance at zero bombard¬ 
ing energy when 7 °a = — (l -h 1 ) ; in this case bi — § for all 1. Compare with footnote 
1, page 80. 
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a 2 

(To - bok 2 a 2 ) 2 + (*a) 2 ’ 
9a 2 cos 2 6 (ka) 4 
(fi ~ bik 2 a 2 ) 2 + (ka) 6 ’ 


l = 0 (19.29) 
l = 1 (19.30) 


We have put f 0 = y2a + 1 and fi = 7 ?a + 2; for resonance, |f 0 | and |fi| 
are small compared to unity. It is easy to show that (19.29) is a mono- 
tonically decreasing function of ka ; however (19.30) has a maximum at 
ka = (f i/bi)* if f i is positive, and decreases monotonically with ka if 
f i is negative. 

If we make use of the relation f 0 = a 0 a cot a 0 a, we see from (19.29) 
that a suitable approximation for the total cross section when the 1 = 0 
partial wave is in resonance is 


G 


4 7T 


k 2 + a 2 cot 2 au a 


(19.31) 


It is apparent that the l = 0 partial wave is in resonance at low bombard¬ 
ing energies whenever a Q a is approximately an odd multiple of 7 r/ 2 , so 
that V 0 a 2 = ir% 2 /8/z, OttW/S/i, etc. The discussion of Eq. (15.3) shows 
that these are just the values of Voa 2 for which new energy levels with 
l = 0 appear. It is true quite generally that a potential well (not neces¬ 
sarily square) that has an energy level nearly at zero exhibits a resonance 
in the low-energy scattering of particles with the same l value (not neces¬ 
sarily zero) as the energy level. From a physical point of view, we can 
say that an incident particle that has nearly the right energy to be bound 
by the potential tends to concentrate there and produce a large distortion 
in the wave function and hence a large amount of scattering. 

Resonance maxima in the low-energy scattering like that found above 
for l = l can appeal* for any / value except l = 0, provided that the 
potential well is not quite dee]) or broad enough to contain a new energy 
level of that angular momentum (this corresponds in the case of the square 
well to having U small and positive). We can think of such a potential 
physically as containing a virtual energy level slightly above zero. While 
a discrete energy level cannot exist with positive energy, the positive 
“centrifugal potential” 1(1 + l)A 2 /2/xr 2 [see the discussion of Eq. (14.18)] 
for l > 0 acts as a potential barrier that impedes the escape of a particle 
that is in the virtual energy level. Figure 20 illustrates this barrier, 
which is characterized by a small transmission at low energies in the same 
way as is the barrier of Fig. 14 [see the discussion of Eq. (17.7)]. Thus 
the virtual level has a kind of transient existence and produces a greater 
distortion of the incident wave function at its energy than at neighboring 
energies. 



114 


QUANTUM MECHANICS 


[Chap. Y 


Angular Distribution at Low Energies. When the bombarding energy 
is small but not zero, the partial wave l = 1 may have an observable effect 
on the scattering. If only S 0 and 5i are appreciably different from zero, 
Eqs. (19.12) and (19.13) become 

a (^) = W 2 ^ n2 5 ° ® s ^ n 5o s ^ n Sl cos ~~ 5 o) cos 9 

+ 9 sin 2 5i cos 2 6] (19.32) 

<r = —\ (sin 2 8 0 + 3 sin 2 $i) 

In the absence of resonance, we see from Eqs. (19.27) and (19.32) that 
the ratio of the contributions to the total cross section of the partial waves 
l = 1 and Z = 0 is of order (£a) 4 . However, the ratio of the largest angle- 



Fig. 20. Effective potential energy [V(r) plus “centrifugal potential”] for l > 0, when 
V = 0 for r > a. The dashed portion for r < a depends on the shape of V. The effective 
potential barrier (r > a) resembles the barrier of Fig. 14 in that it has a small trans¬ 
mission for E slightly greater than zero,, 

dependent term in the differential cross section (which is proportional to 
cos 9) to the constant term is of order ( ka ) 2 . 

Thus the partial wave 1=1 manifests itself in the angular distribution 
at a lower energy than that at which it becomes significant in the total 
cross section; this is because of its interference with the stronger partial 
wave 1 = 0. For example, if So = 20° and = 2° at a particular bom¬ 
barding energy, the partial wave 1 = 1 contributes only 3 per cent to the 
total cross section while it makes the forward scattering (9 = 0°) 3.5 
times as great as the backward scattering (9 = 180°). 

20. SCATTERING BY A COULOMB FIELD 

It was noted in Sec. 19 that the Coulomb field is an exceptional scat- 
terer so far as the application of the method of partial waves is concerned. 
If V(r) = ZZ'e 2 /r for a collision between particles of charges Ze and 
Z'e, it is easily seen that Eq. (19.3) becomes asymptotically 

Xi(^) c^- l ^ T ~ n 


( 20 . 1 ) 
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Here n = nZZ'e 2 /h 2 k = ZZ'e 2 /hv, where v is the relative velocity and n is 
the reduced mass. Thus the radial solutions never approach the sinus¬ 
oidal free-particle solutions, since there is always a logarithmic contribu¬ 
tion to the phase at great distances that cannot be neglected. Although 
it is still possible to obtain a solution of this scattering problem in spheri¬ 
cal coordinates (this is given below), the phase shifts 81 introduced in 
Sec. 19 are altered in meaning. This section presents the analytical 
work in outline, and quotes the principal results from more extended 
treatments. 1 

Parabolic Coordinates. So long as <r(0) for a pure Coulomb field is all 
that is desired, it is simpler to work with the separation of the wave 
equation in parabolic coordinates (see Sec. 16) than in spherical coordi¬ 
nates. The reason for this is that the desired solution depends almost 
entirely on the variable £ defined in Eq. (16.25), and not on the other two 
variables rj and <j>. It is apparent that the solution will not involve cj>, 
because of the axial symmetry of the problem; if now the incident plane 
wave term e iks is taken out as a factor, it can be made plausible that the 
rest of the solution does not involve rj. We put 

tic = c ik *f (20.2) 

where u e represents the complete Coulomb wave function (incident plus 
scattered wave). Now u c must contain a part whose dominant asymp¬ 
totic behavior is of the form r“V /cr , but no part that goes like r~ l e~ ikr 
[see Eq. (18.10)]. Since an expression & k3 f(r — z) can be of this form, 
while an expression e ikz f(r + z) cannot, we anticipate that the function / 
appearing in (20.2) will depend only on £ = r — z. 

We substitute Eq. (20.2) into Eq. (16.26) after replacing Z by — ZZ', 
and remember that E > 0. The differential equation for / is then 

i ^ + (1 - - nkf = 0 (20.3) 

The confluent hypergeometric equation 
d 2 F (IF 

z ,w + {l, ~ z) ik~ aF = 0 < 20 - 4) 

which has the solution F(a t b f z) } is equivalent to Eq. (20.3) if we put 

ftt) = Ch\-in, 1,*£) (20.5) 

where C is a constant. 


1 W. Gordon, Zeiltt. J. Physik , 48, ISO (1028); N. F. Mott and H. S. W. Massey, 
“The Theory of Atomic Collisions,” ('hap. Til (Oxford, New York, 1933). For the 
mathematical background, see F. T. Whittaker and G. N. Watson, “A Course of 
Modern Analysis,” -1th ed., Chap. XVI (Cambridge, London, 1935). 
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Confluent Hypergeometric Function. The solution of Eq. (20.4) that 
is regular at z = 0 can be written as a power series 


F(a,b,z) = V _ r(ffl + s)r(6)z» 

k ; 4 rl# + s)T(i +7) 

. a = 0 


1 + 


az 

W\ 


I + l)g 2 . 

6(6 + 1)2! + ‘ 


( 20 . 6 ) 


It is convenient to put F(a,b,z) = W 1 (a,b,z) + W 2 (a,b,z), where W 1 and 
W 2 are separate y solutions of Eq. (20.4). An asymptotic expansion for 
t can then be obtained from the following relations: 


= r T 6-a) o - 6 + 1, -z) 

Wi(a,b,z) = - a,b- a,z) (20.7) 

g(«M —» l + ^ + Ml +1) + . . . 

2 ->oo z 1! z 2 2\ ^ 


The solution of Eq. (20.4) that is irregular at the origin can be taken to be 

G(a,b,z) = iW^z) - iW 2 (a,b,z) (20.8) 

We shall require the irregular solution for problems in which the Coulomb 
held does not extend in to r = 0. 

The asymptotic form of the Coulomb wave function can be obtained 
rom Eqs. (20.2), (20.5), and (20.7). The result through terms of order 

0"“* 1 -i ci 


IS 


Uc -► wtt~i—T- t { e W*-»in*(>•-*)] ("i_ n% 1 

T(1 +*») l L ik(r - z )J 

+ r-'f c (d)e i(kr - n,n -’*’•) 1 (20.9) 

where ' 

f (d) = £11 inH e~ f ” i ° fan 1 ^) 
fr(— in) 2k sin 2 £0 

= 2Fsin He h u ‘* lB)+ir+2im (20.10) 

Vo = arg r(l + in) 

., SC f^L Cr ? SS Section and Normalization. The/, term on the right 
side of Eq. (20.9) represents the outgoing scattered wave, since it is the 
°7 m term m whlch the factor r~ 1 e ikr appears. The first term in Eq. 
,. . s ™ llarly corresponds to the incident “plane” wave; the multi- 

p icative actor —n 2 /ik(r z) can be ignored in the asymptotic region. 
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Both the incident and scattered waves are distorted at infinite distances 
by logarithmic phase factors. In accordance with Eq. (18.11), the dif¬ 
ferential scattering cross section is 

- (8) - I'-w - (wiml 

“ (if?) ooaec ‘** (20.11) 

This is just the formula obtained by Rutherford from classical dynamics, 
and verified experimentally by him for the collisions of alpha particles 
(helium nuclei) with heavier nuclei. It should be noted, however, that 
the angle-dependent part of the phase factor in the scattered amplitude 
f c (6) given in Eq. (20.10) can manifest itself in a nonclassical way when 
the colliding particles are identical (see Sec. 32). 

If the incident beam is normalized to unit flux, the constant C must 
be chosen to be 

C = i»-*r(l + 


so that the Coulomb wave function is 

u c = v“*r(l + in)erl» r & k *F(— 

= tr *r(l + in)er t n *e ikr °™ 6 F( - in,\,2ikr sin 2 £0) (20.12) 

Then the particle density at r = 0 is found from the power series expan¬ 
sion (20.6) to be 


M0) | 2 = |C| 2 = ?> -1 |r(l + in) |V~ n7r 
_ 2 ht 


For small collision speeds (|n| » 1), Eq. (20.13) tells us that 

|//<■(()) | 2 attractive case, n < 0 

M0) | 2 ^ r 2n7r repulsive case, n > 0 


(20.13) 


(20.14) 


The second of Eqs. (20.14) is of some practical interest. The expo¬ 
nential is the dominant factor in the production of reactions between 
positively charged nuclei at low bombarding energies, when the nuclear 
radii may be assumed small enough so that the colliding nuclei have to 
approach to zero distance in order to initiate a reaction. In this case 
exp ( — 2TrZZ'e~/hv) is called the Gamow factor and is the dominant term 
in the rate of many nuclear reactions at low bombarding energies. 

1 G. Gamow, Zcits. /. Physik, 61, 204 (1928); It. W. Gurney and E. U. Condon, 
Phys. Rev., 33, 127 (1929). 
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Solution in Spherical Coordinates. In nuclear collision problems, 
such as that of the scattering of protons of several million electron-volts 
energy in hydrogen, the departures from the Coulomb law of interaction 
at small distances between the colliding particles can affect the scattering- 
cross section. Such problems can be treated by a modification of the 
method of partial waves, developed in Sec. 19, in which an expansion in 
spherical harmonics is made for the pure Coulomb field, and modifica¬ 
tions introduced for the first few l values. In order to apply such a 
technique, we require first a solution for the pure Coulomb scattering in 
spherical partial waves. 

W f e put 

oo 

u c = y Ri(r)Pi (cos 6 ) (20.15) 

1 = 0 

where the radial wave equation is 

1 d/ a dB,\ i 2 nk 1(1 + !) 

r 2 dr \ dr / |_ r r 2 

If we substitute Ri(r) •= r l & kT fi(r) } the equation for fi becomes 

r + \ 2ik r + 2(1 + 1)] + [2 ik(l + 1) - 2 nk-lfi = 0 (20.17) 

This is equivalent to the confluent hypergeometric equation (20.4), and 
has as its solution that is regular at r = 0 

fi(r) = CiF(l + 1 + in, 21 + 2, -2 ikr) (20.18) 

The asymptotic form of (20.18) can be found from (20.7), and gives for 
the radial wave function at great distances 

B,(r) (^rT+i'+inh- sin {kr ~ ~ n ln 2kr + m)> (20 ' 19) 

where r) L = arg T(l + 1 + in). 

The coefficients Ci must be determined so that the partial wave expan¬ 
sion (20.15) is identical with the solution (20.12) in parabolic coordinates. 
Because of the orthogonality of the Legendre polynomials, we have the 
relation 

Ri(r) = J’p t (cos 6)u e (r,6) sin QdB (20.20) 

where u c (r,d) is given by the second of Eqs. (20.12). The complete 
evaluation of this integral can be avoided by making use of the fact that 
we know all about the function Ri(r) except the constant multiplying 


Ri - 0 (20.16) 
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factor Ci. Ci can then be found by matching the known form of Ri(r) 
to Eq. (20.20) near r = 0, and turns out to be 

n _ (2ik) l e~^ nr T(l + 1 + in) 

Ll id (2 1)\ 

We thus obtain as an alternative expression to (20.12) 

eo 

u c = V V(l M (2ikr) l e ikr 

L=0 

• F (l + 1 + in, 21 + 2, — 2ikr)Pi(cos 0). (20.21) 

Modified Coulomb Field. If the actual potential deviates from the 
Coulomb form only at small values of r, we expect in analogy with the 
partial wave treatment of section 19 that only the first few terms in 
the sum (20.21) will be altered. Since each partial radial wave function 
must be a solution of (20.16) outside of the potential anomaly, the only 
change we can make in the function ft and still have it a solution of (20.17) 
is to add in some of the irregular solution G(l + 1 + in , 21 +- 2, —2 ikr) 
defined by (20.8). The manner in which G is to be added in is determined 
by the requirement that the complete wave function shall represent 
asymptotically a Coulomb incident plus scattered wave, plus an extra 
outgoing scattered wave. 

We must, therefore, substitute for each F term in (20.21) a linear com¬ 
bination of F and G in which the amount of the ingoing term W 2 is not 
changed. Such a combination is 

e i6 i(F cos 5i + G sin 8i) = W i.c 2 ' 6 * + W 2 . 

The modified wave function, which is a solution of the wave equation 
outside of the potential anomaly, can then be written 

oo 

u m = u c + v-ie-i’" £ - r( t±A±^ (2ikr) l c ikr 

. — 1) W\{l ”b 1 “b in } 21 -f- 2, —2'fAr)/ > z(eos d) (20.22) 

The asymptotic form of u, n is 

u m -> v J / (2/ + 1)/V' ( ^ 5 />(At) _1 

r-> * 

l — o 

■ sin (hr — I-Itt — n In 2 hr + rn + 8i)Pi (cos 0) (20.23) 
As shown in connection with Eq. (19.6), each term on the right side of 
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(20.23) must be a real function of r, except for complex multiplying fac¬ 
tors, so that the 61 must be real. 

The additional phase shifts can be found by matching each partial 
radial wave in Eq. (20.22) to the interior solution at the edge of the 
potential anomaly, in just the same way as the phase shifts were found 
in Sec. 19. While in Sec. 19 the phase shifts 5 Z represented the departure 
of the wave function from that of a free particle, they here represent 
the departure from the wave function of a particle scattered by a pure 
Coulomb field. 1 It can be shown from Eq. (20.22) that the asymptotic 
form of u m may be written in the form of (20.9), where / c (0) is replaced by 


f m (6) = f c (6) + 7 k~ l {2l + sin 5 z P z (cos 6) (20.24) 

1 = 0 

The differential scattering cross section is just |/ m (0)| 2 , and in general con¬ 
tains interference terms between the Coulomb scattered amplitude / c (0) 
and the extra terms that are determined by the 5 Z . 

Classical Limit for a Pure Coulomb Field. As was discussed in Sec. 
12, we expect the results of quantum and classical theory to coincide 
whenever it is possible to construct wave packets that follow the classical 
trajectory without spreading appreciably and are small enough so that 
the forces are sensibly constant over their dimensions. A wave packet 
cannot be made much smaller than the reduced wave length 

X == — = — 

2 tt fiv 


of the relative motion if it is not to spread apart at once. For a repulsive 
Coulomb field, the distance over which the force varies by an appreciable 
fraction of itself may be taken to be of the order of the classical distance 
of closest approach. This also provides a useful estimate for an attractive 
field, since in all the collisions except the relatively few for which the 
particles are scattered through large angles, they never get closer than 
this distance from each other. 

Thus the classical limit corresponds to making X small in comparison 
with \ZZ f e 1 or 


n = 


zzv 

hv 


» 1 


Large n implies that the angle-dependent part of the phase of / c (0) given 
in (20.10) varies rapidly with 0, so that these rapid oscillations in the 

1 The computation of the 8i in Eq. (20.22) requires knowledge of G at small r; 
useful formulas have been given by F. L. Yost, J. A. Wheeler, and G. Breit, Phys. Rev ., 
49 , 174 (1936). 
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scattering amplitude should have little effect on the scattering when the 
colliding particles are identical (see Prob. 6, Chap. IX). 

It is interesting to note that for the Coulomb field the classical limit is 
approached for small v, whereas for potentials that have a finite range a, 
such as are discussed in Sec. 19, the classical limit is approached for 
\<&a, i.e., for large v. This is because the “size” |ZZ'e 2 //^ 2 | of the 
Coulomb field increases more rapidly than \ = h/y .y as y decreases. 

Problems 

1. Show that the coefficients of scattering by a one-dimensional square well poten¬ 
tial (like Fig. 14 except that V Q < 0) are given by Eqs. (17.5) if the sign of V Q is 
changed there and in the expression for a. Discuss the dependence of transmission 
coefficient on E in this case. 

2. Show that Eqs. (18.4) and (18.7) are valid for a general binary collision if 
V is given by (18.5); make use of conservation of energy and mass. 

3. Show that when a particle of mass mi collides elastically with a particle of mass 
m 2 that is initially at rest, all the recoil (mass ma) particles are scattered in the forward 
hemisphere in the laboratory coordinate system. If the angular distribution is 
spherically symmetrical in the center-of-mass system, what is it for m 2 in the labora¬ 
tory system? 

4 . What must Voa 2 be for a three-dimensional square well potential in order that 
the scattering cross section be zero at zero bombarding energy (Ramsauer-Townsend 
effect) ? Find the leading term in the expression for the total cross section for small 
bombarding energy. 

5. State clearly the assumptions that go into the derivation of Eq. (19.31), and 
verify that it is a suitable approximation for the total cross section at low bombarding 
energies when the l = 0 wave is in resonance. 

6. Make use of Eq. (19.31) and the result of Prob. 5, Chap. IV, to obtain an 
approximate expression for the total scattering cross section by a particular potential 
in terms of the bombarding energy E and the binding energy e of a particle in that 
potential, when E and e are small in comparison with Vo. 

7. Compute and make a polar plot of the differential scattering cross section for a 
perfectly rigid sphere when kn = -J, using the first three partial waves (L — 0, 1, 2). 
What is the total cross section in this case, and what is the approximate accuracy of 
this result when the three terms arc used? 

8. Find a general expression for the phase shift produced by a scattering potential 
F(r) = A/r 2 , where A > 0. Is the total cross section finite? If not, does the diver¬ 
gence conn; from small or large scattering angles, and why? What modifications are 
necessary in the calculation if A < 0? Are any difficulties encountered in this latter 
case? 

9. Protons of 200,000 electron-volts energy are scattered from aluminum. The 
directly back scattered intensity (0 = 180°) is found to be 96 per cent of that com¬ 
puted from the Rutherford formula. Assume this to be due to a modification, of the 
Coulomb potential that is of sufficiently short range so that only the phase shift for 
l — 0 is affected. Is this modification attractive or repulsive? Find the sign and 
magnitude of the change in the phase shift for l = 0 produced by the modification. 



CHAPTER VI 

MATRIX FORMULATION OF QUANTUM MECHANICS 

In the last four chapters the Schrodinger wave equation was developed 
and its solutions obtained in some cases of physical interest. We now 
turn to a different formulation of quantum mechanics, in which dynamical 
variables such as the coordinates, momentum components, and energy 
of a particle appear explicitly in the equations of motion of the system 
without their having to multiply or differentiate a wave function. The 
classical equations are just of this structure, so it might be expected that 
there would be a closer resemblance between the classical and quantum 
formalism here than in the Schrodinger theory. 

This is actually the case; the principal formal difference is that the 
quantum dynamical variables do not obey the commutative law of 
multiplication. It is convenient to represent such noncommutative 
dynamical variables, which are often simply called operators , as matrices. 
Matrix theory provides an expecially flexible representation scheme, since 
there are an arbitrarily large number of ways of choosing the rows and 
columns of a matrix, all of which are of equal validity. It is because of 
the close formal resemblance between quantum matrix mechanics and 
classical dynamics that this was historically the first formulation of 
quantum theory to be discovered, by Heisenberg in 1925. 1 

In this chapter we first review briefly the more important properties 
of matrices, and then show their connection with quantum theory and 
their usefulness in particular problems. 

21. MATRIX ALGEBRA 

We restrict our discussion at first to matrices that have a finite num¬ 
ber of rows and columns, and then indicate how the results obtained can 
be taken over to matrices that have an infinite number of rows and 
columns. 2 

1 W. Heisenberg, Zeits. f. Physik , 33, 879 (1925); M. Born, W. Heisenberg, and 
P. Jordan, Zeits . /. Phys. 35, 557 (1925). The connection between quantum matrix 
mechanics and the wave equation was established by E. Schrodinger, Ann. d. Physik, 
79, 734 (1926), and C. Eckart, Phys. Rev., 28, 711 (1926). 

2 For a fuller discussion, see J. von Neumann, “ Mathematische Grundlagen der 
Quantenmechanik,” Chap. II (Springer, Berlin, 1932; reprinted by Dover, New 
York). 
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Matrix Addition and Multiplication. A matrix is a square or rec¬ 
tangular array of numbers that can be added to or multiplied into another 
matrix according to certain rules. We denote a matrix by a capital 
letter, such as A, and the numbers or elements that make it up by the same 
letter with subscripts, such as A U ) here, k designates the row and l the 
column in which the matrix element A k i appears. Two matrices can be 
added when they have the same rank , i.e., the same number of rows and 
the same number of columns. Addition is commutative: 

A + B = B + A (21.1) 

If the sum matrix is called C , then 

Cm = Am + Bm ( 21 . 2 ) 

A matrix A can be multiplied from the left into a matrix B if the num¬ 
ber of columns of A is equal to the number of rows of B \ then the product 
matrix C has the number of rows of A and the number of columns of B. 

C = AB, Cu = ^ A km B ml (21.3) 

m 

where the summation is over the subscript m, which denotes the columns 
of A and the rows of B. Tt follows at once from Eqs. (21.2) and (21.3) 
that the distributive law of multiplication is valid. 

A(B + C) = AB + AC (21.4) 

Also, the associative law of multiplication is seen to be valid: 

A(BC) = ( AB)C (21.5) 

where the left side means that A is multiplied from the left into the 
product of B and C } and the right side means that the product of A and B 
is multiplied from the left into C. The product (21.5) is written simply 
as ABC and, from (21.3), has the explicit expression 

D = ABC, D u = ^ A ^B uin Cm (21.6) 

m,n 

It is clear from Eq. (21.3) that AB is not in general equal to BA ; thus 
the commutative law of multiplication is not generally valid. 

Null, Unit, and Constant Matrices. For an arbitrary square matrix 
A, the null matrix O is defined by the equations 

OA = O, AO = O, (21.7) 

from which it follows that all the elements of O are zero. If A is not 
square, the elements of O are still all zero, but the O’s that appear at 
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different places in (21.7) do not all have the same numbers of rows and 
columns. 

The unit matrix 1 is defined by 

1A = A, B1 = B, (21.8) 

for arbitrary matrices A and B. From Eqs. (21.8) it follows that 1 is a 
square matrix whose rank (number of rows or columns) equals the num¬ 
ber of rows of A or the number of columns of B. Moreover, 1 must have 
unit elements along its principal diagonal (k = l ) and zeros elsewhere, so 
that the elements of 1 equal the Kronecker symbol hi introduced in 
Sec. 10. % 

The product of a number c and a matrix A is the matrix cA that results 
from multiplying each element of A by c. Thus if we define a constant 
matrix C to be a multiple of a unit matrix so that each nonvanishing 
element is c instead of unity, then 

cA = CA, where Cm — c8 k i (21.9) 

are the matrix elements of the constant matrix C. 

Spur, Determinant, and Inverse of a Matrix. The spur of a square 
matrix, often called the trace or the diagonal sum , is the sum of the 
diagonal elements of the matrix: 

SpU) = 2 A kk (21.10) 

k 

The determinant of a square matrix is found from the usual rule for the 
computation of the determinant of a square array of numbers. 

A matrix A may or may not possess an inverse A"" 1 , which is defined 
by the relations 

AA“* = 1, A-'A = 1 (21.11) 

A is said to be nonsingular if it possesses an inverse, and singular if it does 
not. If A is square, the kl element of its inverse is just the cofactor of 
Aik divided by the determinant of A; in this case A is singular if its 
determinant vanishes. It is readily verified that for nonsingular matrices 
A, B, C 

(ABC)- 1 = C-^A- 1 (21.12) 

Hermitian and Unitary Matrices. The Hermitian adjoint A* of a 
matrix A is the matrix obtained by interchanging rows and columns and 
taking the complex conjugate of each element; thus if 


£ = A* then B kl = A lh 


(21.13) 
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It is readily verified that the Hermitian adjoint of the product of a series 
of matrices is the product of their adjoints in the reverse order 

(ABC)* = C*B*A* (21.14) 

A matrix is Hermitian or self-adjoint , if it is equal to its Hermitian 
adjoint; thus, A is a Hermitian matrix if 

A = A* (21.15) 

Evidently only square matrices can be Hermitian. 

A matrix is unitary if its Hermitian adjoint is equal to its inverse; 
thus A is a unitary matrix if 

A* = A~\ or AA* = 1 and A* A = 1 (21.16) 

Unitary matrices need not be square. 

Transformation and Diagonalization of Matrices. We define the 
transformation of a square matrix A into a matrix A' by a nonsingular 
matrix S, by the following equation: 

SAB - 1 = A' (21.17) 

It is evident then that S~~ l transforms A' back into A. 

The form of a matrix equation is unaffected by transformation. Thus 
the equation 

AB + CDE=F 

may be transformed into 

SA ftSM + SCDES - 1 = SFS- 1 
which is equivalent t o 

SAB- 1 • SBS~' + SC8~ l * 8 DS- 1 ■ SES~ l = SFS- 1 

or to 

A 'B' + C'D'FA = F r 

where the primes denote transformed matrices. This invariance of 
matrix equations with respect to transformations makes it possible to 
work with any convenient transformation of a set of matrices without 
affecting the validity of any results obtained. 

A square matrix is diagonal if it has nonvanishing elements only along 
the principal diagonal (k = 1). The diagonal elements are then called 
the eigenvalues of the matrix. It is easily seen that the wth power of a 
diagonal matrix is also diagonal, and has as its eigenvalues the nth 
powers of the eigenvalues of the original matrix. The matrix A in Eq. 
(21,17) is said to be diagonalized, by the matrix S if the matrix A f that 
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results from the transformation is diagonal, so that A kl = A k 8 k i. To 
find A' explicitly, we multiply (21.17) through on the right by S. 

SA = A'S (21.18) 

The set of linear algebraic equations that are obtained by writing out 
the elements of Eq. (21.18) for a particular row k and all columns l is 

^ SkmA m i = A'jJSm or ^ Skm(A m i — A k 8 m i) = 0 (21.19) 

m m 


where A k is a particular eigenvalue of A ' and the subscript m is summed 
over from unity to the rank N of the matrix A. 

Now (21.19) may be regarded as a set of N homogeneous algebraic 
equations for the transformation matrix elements & w , where k is fixed. 
The necessary and sufficient condition that these equations have a solu¬ 
tion is that the determinant of their coefficients vanish, or that the 
determinant of the square matrix (. A mi — A k 8 m i) be zero. This provides 
a single algebraic equation, called the secular equation , which is of order 
N and has N roots A k . Thus the eigenvalues of the diagonal matrix A' 
resulting from A by transformation are the same no matter how A is 
diagonalized, except perhaps for the order in which they are arranged; 
for this reason they are also called the eigenvalues of the original nondiag¬ 
onal matrix A. 

Matrices of Infinite Rank. The rules (21.2) and (21.3) for addition 
and multiplication of matrices may be taken over in an obvious way for 
matrices that have an infinite number of rows and columns, provided 
that the infinite sum in (21.3) converges. We sometimes deal with 
matrices that have a nondenumerably infinite number of rows or columns 
or both; in this case, one or both of the matrix subscripts becomes 
a continuous variable, and the usual summation over a subscript is 
replaced by integration over this variable. We do not consider these 
possibilities in detail here, but simply assume that all reasonable results 
can be taken over from finite- to infinite-rank matrices without difficulty. 1 
The statement that a Hermitian matrix of infinite rank is square means 
that its rows and columns are labeled in the same way. A unitary 
matrix of infinite rank need not be square. Its rows and columns can 
be labeled differently; for example, the number of rows may be denumer- 
ably infinite and the number of columns nondenumerably infinite. 

We are concerned primarily in quantum mechanics with Hermitian 
and unitary matrices, mainly of infinite rank. A fundamental theorem 


1 A more thorough discussion of this point, and a proof of the following theorem, 
are given by J. von Neumann, loc. cit. 
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that we shall assume without proof is that any Hermitian matrix can be 
diagonalized by a unitary transformation; a corollary of this theorem is 
that the resulting eigenvalues of the Hermitian matrix are unique, 
except perhaps for the order in which they are arranged. Starting from 
this theorem, it is not difficult to show (see Prob. 1) that the necessary 
and sufficient condition that two Hermitian matrices can be diagonalized 
by the same unitary transformation is that they commute (matrices A 
and B commute if AB = BA). 

It also follows from this theorem that the eigenvalues of a Hermitian 
matrix are real. If the S and A in (21.17) are unitary and Hermitian, 
respectively, that equation can be rewritten 

SAS* = A' (21.20) 

The Hermitian adjoint of Eq. (21.20) is, from (21.14), 

SAS* = A'* 

Since then A'* = A ', this shows that the Hermitian property is 
maintained during transformation by a unitary matrix. If A' is diagonal 
as well as Hermitian, it follows from (21.13) that its eigenvalues are real. 
It is easily seen that the converse is also true: a matrix that has real 
eigenvalues is Hermitian. 

It is important to note with matrices of infinite rank that both of Eqs. 
(21.11) must be valid in order for A~ l to be the inverse of A. Similarly, 
both of the latter pair of Eqs. (21.16) must be valid if A is to be unitary. 

22. MATRICES IN QUANTUM MECHANICS 

The appearance of matrices in quantum mechanics can be connected 
in a simple way with the solution of the Schrodinger equation (8.2). In 
this section we adopt the Hamiltonian notation, and justify it in detail 
in Sec. 23. We rewrite Eq. (8.2) as 

Hu k { r) = E,Mr) (22.1) 

where the subscript k denotes the different members of the complete 
orthonormal set of energy eigenfunctions U/ C (r) and their corresponding 
eigenvalues E k [k specifies the energy, and also distinguishes between 
degenerate eigenfunctions; it thus includes both E and 5 of Eq. (10.7)]. 
The Hamiltonian or energy operator II is given by 

" - S + rv - - s v ’ + ™ < 22 - 2 > 

In accordance with the discussion of Set;. 8, k may be a discrete or a 
continuous variable, or discrete over part of its range and continuous 
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over the rest. We shall use the symbol $ or to denote a complete 
summation over the subscript k (the summation includes an integration 
fdk over the continuous part of its range). 

A Unitary Transformation Matrix. Suppose that we have a second 
complete orthonormal set of functions v w (r), which are not necessarily 
eigenfunctions of the Schrodinger equation (22.1) with the actual poten¬ 
tial energy F(r); they might, for example, be momentum eigenfunctions 
(11.4) or (11.11), or hydrogen-atom wave functions (16.24) supplemented 
by continuous-eigenvalue Coulomb-field wave functions like those dis¬ 
cussed in Sec. 20. The v n can be expanded in terms of the u k : 

(r) (r) 

where it is readily verified from the orthonormality of the u k that 

Shn = Jtf*(r)tf n (r)dT 

Similarly 

Uh( r) = SnSknVnir) (22.3) 

We can now see that the matrix of which S kn are the elements is 
unitary. 

(SS*) k l = S nSknBln = S*J^( r X( r )^ r jv n (j')v,i{r')dT' 

= JK(r)^(r')5(r - r')drdr' = JiJ*( r )u,(r)dr (22.4) 

where use has been made of the closure property that is possessed by any 
complete orthonormal set of functions such as the v n (r) [see the discussion 
of Eq. (10.11)]. The last integral on the right side of (22.4) is a Kronecker 
8 symbol or a Dirac 8 function according as k is one of a discrete or a con¬ 
tinuous set of subscripts; in either case, this integral is equivalent to an 
element of the unit matrix, so that we have shown that SS* = 1. In 
similar fashion, it can be established that 

(S*S) nm = S k S kn S km = (1) ntn 

Hamiltonian Matrix. The functions r n (r) can be used to calculate a 
Hamiltonian matrix 

Hum = fv n (r)Hv m (r)dT (22.5) 

where H is the operator given in (22.2). We now consider the connection 
between the Hamiltonian matrix (22.5) in the v n representation , and the 
energy eigenvalues E k . 

Transformation of H nm by the unitary matrix S gives 
(SHS*) kl = S n . m S kn H nm Si m 
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where the prime on H indicates that it operates only on the variable r' 
to its right. After performing the summation over the subscript m to 
get 5(r' — r"), it can be shown by means of two partial integrations that 

Jff'8(r' - i n )u l {x ,r )dr n = jui(r' f )H"8(r f - r")dr" 

= J6(r' - r /, )H"u 1 (i")dr f/ 

= H'ui( r') 


On doing the sum over n and dropping the primes, we obtain 

(SHS*U = SM*)Hui(r)dr = E k b kl or E k 8(k - l) (22.6) 

which is in diagonal form with the eigenvalues E k . 

Thus the problem of solving the Schrodinger equation is completely 
equivalent to the problem of diagonalizing the Hamiltonian matrix when 
it is expressed in some arbitrary representation such as that provided by 
the functions v n . The eigenvalues of the Hamiltonian matrix are the 
energy eigenvalues of the Schrodinger equation, and the unitary trans¬ 
formation matrix S that diagonalizes Ii serves, through Eq. (22.3), 
to give the energy eigenfunctions u k { r) in terms of the arbitrary initial set 
of functions. 

It is interesting to note that S is not necessarily a square matrix. For 
example, the functions v n may be the eigenfunctions of a three-dimensional 
harmonic oscillator, which form a completely discrete set, while the func¬ 
tions u k may be momentum eigenfunctions, which form a completely 
continuous set. However the Hamiltonian matrix, in both its diagonal 
form (22.6) and its nondiagonal form H nm , is square. 

Dynamical Variables as Hermitian Matrices. The eigenvalues of the 
Hamiltonian are found in (22.6) to be the real energy levels E k . It then 
follows from Sec. 21 that H is a Hermitian matrix in any representation. 

The discussion of Sec. 10 developed the interpretation that the eigen¬ 
values of any operator that represents a dynamical variable are real, since 
they are the only possible results of precise physical measurement of that 
variable. Any dynamical variable can be represented as a matrix whose 
diagonal representation has its eigenvalues along the principal diagonal, 
and that can be transformed to any other representation by a suitable 
unitary matrix. Thus any physically measurable dynamical variable 
can be represented by a Hermitian matrix, and is said to be Hermitian. 

Wave Functions as Unitary Matrices. Any complete orthonormal set 
of functions, such as the u k (r) or the v n (r), possesses the closure property 


= S( r — r') 


(22.7) 
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and the orthonormality property 

fu k (t)ui(x)dT = S k i or 8(k — l ) (22.8) 

If now we regard the function u k { r) as a two-dimensional array of 
numbers in which the rows are labeled by the position variable r and the 
columns by the subscript k , it is equivalent to a matrix U T k- Equation 
(22.7) is then equivalent to the matrix equation 

(uu*) = s k u*u,„ = (i)„ ( 

Similarly, Eq. (22.8) is equivalent to 

{U*U) U = Sr U tk U tl = (1)„ 

Thus U is a unitary matrix. 

The r Representation. This suggests that such a unitary matrix be 
used to effect a transformation. Suppose that we transform the Hamil¬ 
tonian matrix H nm given in (22.5) by means of the unitary matrix 

V T n = V n (r). 

(VHV*) IT ' = Sn,?nV in H nm V r r m = Sn,mVn(r)H um V m (r') 

= Sn.mV n (t)Sv n (l")H"V m (r")dT'' ■ V m (l') 

= fS(r - T')H"8(r" - r')dr" = HS( r - r') (22.9) 

The last step makes use of partial integrations and implies that H operates 
only on the variable r in <5(r — r'). • 

The result embodied in (22.9) is the inverse of the definition (22.5). 
The latter takes a differential operator H and constructs from it a matrix 
representation, while the former transforms that matrix back into what is 
effectively the differential operator. However, we now see what was not 
so obvious from the form (22.2) for H : that a differential operator on the 
space coordinates can be expressed as a matrix in a representation in 
which the rows and columns are labeled by the position variables r and r'. 
From this point of view, the solution of the Schrodinger differential wave 
equation (22.1) is equivalent to the diagonalization of the matrix 

H Ix r = H8( r - r') 

just as we saw above that it is equivalent to the diagonalization of the 
matrix H nm- 

The r representation is one in which the coordinate r is diagonal: 
(r)r'r" = r'5(r' — r"). It is worth pointing out explicitly that the Hamil¬ 
tonian H n / in the r representation is not diagonal, even though the 8 
function makes it vanish when r differs from r' by a finite amount. This 
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is because derivatives of the 8 function appear, and these have nonvanish¬ 
ing matrix elements infinitesimally removed from the diagonal r = r'. 
For example, the matrix /(r) 8 (r — r') is diagonal, but the matrix 

(i d/dx ) 8(r — r') 

and the matrix V 2 5(r — r') are not. 

A Useful Identity. If Q is an operator whose effect on a function /(r) 
can be represented as 

m) = JO(r,r')/(r0dr' 

then we can establish an identity that is sometimes useful: 

smmr)]dr = M/(r)ir (22.10) 

If we regard 12(r,r') as a matrix in the r representation, the Hermitian 
adjoint of Q operating on g( r) is 

n*g(. r) = 

from which Eq. (22.10) follows at once. 

This identity enables us to transfer operators from one factor of an 
integrand to another. The partial integrations in Eqs. (7.9), (7.10), 
and (12.3) are examples of the application of (22.10). In those cases, 
Q is a differential operator, and its matrix representation involves deriva¬ 
tives and multiples of the 8 function. However, Q, need not be of this 
specialized type [it might, for example, be an integral operator like the 
square bracket in Eq. (10.19)], and (22.10) is valid as well for operators 
that have non vanishing matrix elements a finite distance from the 
diagonal. 

23. EQUATIONS OF MOTION IN MATRIX FORM 

In the preceding section we discussed the principal transformation 
properties of matrices that represent dynamical variables at a particular 
instant of time. We now find equations of motion for these variables by 
computing the time rate of change of their matrix representations. The 
resulting equations are very similar in form to the classical equations of 
motion and indicate a general procedure for quantizing any classical 
system. 

Time Rate of Change of a Matrix. We take as our starting point the 
time-dependent Schrodinger wave equation (6.16), expressed in terms of a 
Hamiltonian operator, 

ih ~ 


(23.1) 
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where a typical H is that given by Eq. (22.2). A matrix representation 
for any function F of the dynamical variables can be found in terms of a 
complete orthonormal set of functions, each of which depends on the 
time m accordance with (23.1). We call any two of these functions * 
and and calculate the time rate of change of a typical matrix element 

/ 4>Fif/dr = // 4(U)F(r,r'M{r',t)dTdT' 

It is assumed that Fisa general operator (not necessarily a differential or 
multiplicative operator) of the type considered in Eq. (22.10), which mav 

depend explicitly on the time. y 

Differentiation with respect to t gives 


It I Wr *(r,0 [|F(r,r',o] 
+ J J <Kr,0F(r,r',i 


Hi'Mrdr' 


dt 


rV) 


drdr' 


+ 


// ^m] F(r,r'M(t',t)dTdT' 
= j 4 >~Wt + 1 J! ^(r,0F(r,r',<)[//V(r',0]drdr' 

~e/J [^( r .<)]f ? (r,r , ^)^(r , ,Odrdr' 

dFlTt ^suhst^t 011 A? Side iS the matrix element of th e operator 
, it ’ substltutlon h as been made from Eq. (23.1) We make use 

FVfnthetsft^ 1 ^ d° “ * ^isWrmitiaT ^ 

m the las t term, and obtain the equation 


It j Wt = / 


- dF 1 

*n* d,+ k 


KFH - HF)tdr 


funo T t 1 h oLi 1 L b !.''" i “ e °. aS ? e, "“ ion fot raatl “ ‘he 

functions f and <f> are quite arbitrary at any instant of time: 

dF dF 1 

dt ~ It + ih ( FH ~ HF ) (23.2) 

Jhanee^of tL° f ^ Wh ° Se elements are the time rate of 

derivativlomr ; X !T nt L° f F> and may be called the ^tal time 

of the rJrtinldl ^ firSt term ° n the right side is the matrix 

eL w d d , enYati :i oi F Wlth ^Peet to t, and takes into account the 
explicit dependence of F on the time. The last term is that part of the 

time derivative of the F matrix that arises from the change in time of 
the functions with respect to which the matrix is calculated Equation 
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(23.2) is Heisenberg’s form of the equation of motion of a dynamical 
variable. 

Classical Lagrangian and Hamiltonian Equations of Motion. In 

order to bring out the similarity between Eq. (23.2) and the corresponding 
classical equation, we review briefly the structure of classical Hamiltonian 
theory. The equations of motion of a conservative dynamical system 
that has / degrees of freedom may be derived from a Lagrangian function 
L(qi } . • • , qfj 4h • • • , if, t) of the coordinates q { , the velocities 
qi = dqi/dt , and the time, by means of a variational principle: 1 


8 J ( l Ldt = 0, 8qi(ti) = 8qi(h) = 0 
The resulting Lagrangian equations are 



(23.3) 


(23.4) 


If now we define a momentum canonically conjugate to qi as Pi = dL/dqi , 
and a Hamiltonian function of the coordinates and momenta as 


/ 

H(qi ■ ■ ■ q f ,V i ’ • • = £ V4i ~ L 


variation of H leads to the Hamiltonian equations of motion 

dH 


dH 

Qi 

opi 


Vi = " % 


i 


• / 


(23.5) 

(23.6) 


The time dependence of any function of the coordinates, momenta, and 
the time, calculated along a moving phase point, is 



* (If, Pi • 



dF V (SF dH _ dH dF\ 
dt 4 \dgt dpi dqi dpj 

i = 1 


on making use of the Hamiltonian equations (23.6). The Poisson bracket 
{ A,B\ of any two functions of the coordinates and momenta is defined as 


\A,B\ 


f 



8A dB 
dq f dpi 


i — 1 


dB 

dqi dpi) 


(23.7) 


1 E. T. Whittaker, “Analytical Dynamics,” 3d ed., Hecs. 99, 109 (Cambridge, 
London, 1927). 
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In terms of the Poisson bracket, the equation of motion for the function 
F of the dynamical variables becomes 

§ - %+^ 

The left side of Eq. (23.8) is the total time derivative of F along a 
moving phase point, the first term on the right side takes into account the 
explicit time dependence of F, and the last term shows the change in F 
due to the motion of the phase point at which F is evaluated. Thus there 
is a strong resemblance between Eqs. (23.2) and (23.8); the effect of 

the moving phase point in the latter corresponds to the effect in the 

former of the change with time of the functions that specify the matrix 
representation. 

Poisson Brackets and Commutator Brackets. The resemblance 

between Eqs. (23.2) and (23.8) suggests that the quantum analogues of 
the classical equations of motion be found in general by substituting the 
commutator bracket divided by ih for the Poisson bracket: 

{A,B} - i [A,B] = I (AB — BA) (23.9) 

There are two observations that lend support to this suggestion. The 
first concerns the classical conditions for a contact transformation from 
one set of canonical variables q i} pi to another Qi,Pi\ l 

{Qi,PA = «#, IQiyQA = o, = o (23.10) 

where the Poisson brackets are calculated with respect to the original 
variables q^pi. Now we saw in Sec. 6 that a successful transition from 
classical to quantum theory could be made by substituting the differential 
operator —ih(d/dx) for p x , etc. The commutator of x and p x can then be 
found by letting it operate on an arbitrary function gr(r) of the coordinates. 

(xp x - Pxx)g(i) = — (xg) = ihg( r) (23.11) 

Since g(r) is arbitrary, this and the other commutators may be written 
as operator equations 

d_ 
dx 

A _ A 

dy by 

xy - yx = 0, p x p y - p y p x = 0, etc, 

1 Whittaker, op. cit., pp. 300, 307, 


xp y 


— PyX = —ih (x 



xp : 


- Pl x = -a(x£ 
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These are in agreement with the classical equations (23.10) when the 
substitution (23.9) is made. 

The second observation is that the algebraic properties of the com¬ 
mutator brackets are identical with those of the Poisson brackets. It is 
readily verified from the definition (23.7) that 

{A,B} = — {B,A}, {A,e} =0, where c is a number 

{(Ax + A 2 ), B) = [AijB] + {A 2 ,B\ (23.12) 

{AiAo,B} = {A h B}A 2 + A 1 {A,,B} 

{A,{B,C}} + {B,{C,A} \ + {C,{A,B}} =0 
The order of possibly noncommuting factors has not been altered. 
Dirac 1 has shown that the form of the quantum analogue of the Poisson 
bracket is determined by Eqs. (23.12) to be the right side of (23.9); the 
constant h is, of course, arbitrary so far as this discussion is concerned 
(see also Prob. 10). 

Quantization of a Classical System. It is plausible on the basis of the 
preceding discussion to assume that any classical system can be taken 
over into quantum mechanics by finding the classical Iiamiltonian func¬ 
tion and equations of motion in terms of some set of canonical variables 
qi,Pi, and replacing the Poisson brackets in (23.8) and (23.10) by com¬ 
mutator brackets in accordance with (23.9). The canonical variables are 
then found to obey the quantum conditions 

ki,Pj] = ifiSij, [q u qi] = 0, [p,pi] = 0 (23.13) 

This quantization technique will be found useful in connection with 
classical wave fields (Chap. XIII) as well as classical particles. 

Two precautions are found necessary in applying this technique. 
First, the coordinates and momenta must be expressed in cartesian coordi¬ 
nates. And second, ambiguities in the order of noncommuting factors 
are usually resolved by taking a symmetric average of the various possible 
orders. These precautions are illustrated in the following example. 

Motion of a Particle in an Electromagnetic Field. As an example of 
the foregoing quantization technique, we consider the problem of the 
motion of a charged mass point in a general external electromagnetic 
field. The classical Hamiltonian, expressed in terms of the canonical 
variables r, p and the electromagnetic potentials A(r,£), </>( r »0> is 2 



(23.14) 


1 P. A. M. Dirac*,, “The Principles of Quantum Mechanics,” 2ded., Sec. 25 (Oxford, 
New York, 1935). 

2 J. H. Van Vleck, “The Theory of Electric and Magnetic Susceptibilities ,” pp. 
7, 20 (Oxford, New York, 1932). Gaussian units are used throughout the present 
book. 
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where e is the charge on the particle and c is the speed of light; the electric 
and magnetic field strengths are given in terms of the potentials by 

E = ~ ~ ~ grad 4>, H = curl A (23.15) 

c ot 

The quantum conditions (23.13) in cartesian coordinates are 

fopj = [y,Vv\ = fopj = ih (23.16) 

with other pairs of coordinate and momentum components commuting. 
We now use Eq. (23.2), with the expression (23.14) for H and the relations 
(23.16), to calculate expressions for the particle velocity dx/dt and 
acceleration dh/dt 2 , for comparison with the corresponding classical 
expressions. 

Evaluation of Commutator Brackets. In order to facilitate evalua¬ 
tion of some of the commutator brackets that arise from substitution 
into (23.2), we derive a few elementary results. Any two functions of r 
commute with each other, since all components of r commute with each 
other. It follows from (23.16) that 

x 2 p x — Px% 2 = x{p^ + ih) — PxX 2 

= (PxX + ih)x + ihx — PxX 2 = 2 ihx 

It is readily shown by induction that 

x n p x — p x x n = nihx n ~ 1 (23.17) 

It follows from (23.17) that for any function /(r) that can be expressed 
as a power series in x,y } z, the relation 

[/(r),pj = /( t)Vx - Pxf( r) = ih (23.18) 

is valid. 1 Equation (23.18) can also be established for more general 
functions than power series by making use of the representation of p % 
as — ih{d/Bx) ) as in (23.11); if we operate with the left side of (23.18) 
on an arbitrary function we obtain 

[/(r),p*Mr) = -ih f(r)A - A-/( r ) g (r) = g(r) ^ih £/(r) j 

which is equivalent to the operator equality (23.18) since g( r) is arbi¬ 
trary. By repeated application of (23.18) it is easily shown that 

f(r)pl - plf( r) = ih ( Px ^- + d l = 2 ih d £p* + W § (23.19) 

'This corresponds to the classical relation (/(r),p*J = df(t)/dx [compare with 
Eq. (23.9)]. 
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Velocity and Acceleration of a Charged Particle. The Hamiltonian 
(23.14) may now be written, with the help of (23.18), 

n2 - " 2 


H = £-2^c (p ‘ A + A ' P)+ 2i 

ieh 


A 2 + e 4> 


= X— -— A • p + 4—~ div A + 5—5 

2m me v 2mc 2 me 2 


A 2 + e<t> (23.20) 


The time derivative of a component of r is then easily shown from (23.2) 
to be 

dx 


dt 


= — (px - - 

m \ c / 


(23.21) 


in agreement with the classical relation between the velocity and momen¬ 
tum of a particle in the presence of an electromagnetic field. 

The calculation of a component of the acceleration of the particle 


d 2 x 
dt 2 


1 

m 


dpx 

dt 


X 


6 dAg 

C dt 

e dA x 


ihm mc dt 


■ [A X ,H] 


ihmc L " 

is straightforward, but rather tedious. The result may be written 

e 


d 2 x _ _ e /l dAx , d<A 
dt 2 m \c dt dx) 


+ 


~ %) (”’ “ . A ’)l ~ [(’’■“ « A ’) Car 

+ (23 ' 22) 

The Lorentz Force. Equation (23.22), with the similar y - and 
2 -component equations, can be written as a single vector equation for the 
“force” 

-'(“if _g " d *) 

l [i(p - l A ) X i™ 1 A > - !c “' A > X i (p - ; A ) 

;(I XH - HX $) 


m 


d^r 

A 2 


+ i 
= eE + ^ 


(23.23) 


where use has been made of Eqs. (23.15) and (23.21). Equation (23.23) 
is in agreement with the corresponding classical expi ession 


eE + ;(vXH) 
c 
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where v = dx/dt is the velocity of the particle, if we take a symmetric 
average of the two terms v X H and — H X v ; these are identical clas¬ 
sically but differ in quantum mechanics since the v given by (23.21) does 
not commute with H. 

Equation (23.23) includes a generalization of Ehrenfest’s theorem, 
which was discussed in Sec. 7. If we consider a diagonal element, the 
left side is the product of the mass and the second time derivative of the 
expectation value of the position vector of the particle. The right side 
is the expectation value of the Lorentz force acting on the charge of the 
particle. Thus (23.23) states that a wave packet moves like a classical 
particle if it is sufficiently well localized so that the electromagnetic fields 
change by a negligible amount over its dimensions. This result can, of 
course, also be obtained by the method of Sec. 7 when, in accordance 
with (23.1) and (23.20), the Schrodinger wave equation is taken to be 

- (- S'”+S- 41 p * 4 -+ 4 + as? ■ 4! + ■'*}* < 23 - 24 > 

Constants of the Motion. Equation (23.2) tells us that if F does not 
depend explicitly on the time (so that dF/dt = 0), then dF/dt = 0 if F 
commutes with H. In this case, F is said to be a constant of the motion. 
This is usually possible at all times only if H is also constant. If we sub¬ 
stitute H for F in (23.2), Ave see that the constancy of H implies that 
dH/dt = 0, or that H does not involve the time explicitly. Thus if H 
is independent of t f a function F of the dynamical variables of the system 
is a constant if it does not depend on t and commutes with H . 

An example of a constant of the motion is any one of the coordinates or 
momenta of the system whose canonically conjugate momentum or coordi¬ 
nate does not appear explicitly in H. Since the dynamical variable in 
question commutes with all other variables except its own canonical con¬ 
jugate, then it commutes with H. Thus if the Hamiltonian for a number 
of interacting particles does not depend on the position coordinates of 
the center of mass of the system, the total momentum of the system is a 
constant of the motion. This is in agreement with the classical result 
that the total linear momentum of a system of interacting particles is 
constant if there are no external forces. 

In similar fashion, we can make use of the third of Eqs. (14.20) to 
find the condition for constancy of the angular momentum of a particle. 
This equation states that the operator that represents the z component of 
angular momentum is M z = ih(d/dcj >), where 4> is the angular coordinate 
of rotation about the z axis. Thus, as with (23.11), Ave have the operator 
relation 


4>M S - M,4> = ih 


(23.25) 
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and <t> and M z may be regarded as canonically conjugate variables. If 
then H does not depend on the angular coordinates of a particle [for 
example, if V in Eq. (22.2) depends only on the radial distance r from a 
fixed center], M, is a constant of the motion; since there is nothing special 
about the choice of the z axis, M x and M y are also constants. This is 
in agreement with the classical result that the angular momentum of a 
particle moving in a central field of force is constant. 

The Parity Operator. Parity was first discussed in Sec. 9, and defined 
there and in Sec. 14 as the property of an energy eigenfunction that speci¬ 
fies whether it is even or odd with respect to changes of sign of all of the 
space coordinates. The parity operator P can be introduced in quantum 
mechanics, even though it has no classical analogue; it is defined as an 
operator that reflects all coordinates of all particles through the origin: 

Pf(xi,y h zi,x h y2,z 2 , ...,<) 

= /(——2/1, —^1, —o.: 2 , —2/2,—S2, ■■■,£) (23.26) 

It is evident from (23.26) that P 2 is the unit operator 1. Thus if P is 
diagonalized the square of each of its diagonal elements is unity, and its 
eigenvalues are ±1. 

If now II is left unchanged by reflection of all coordinates through 
the origin, P commutes with II and is a constant of the motion. Moreover, 
it follows from Sec. 21 that P and H can be made diagonal at once. Then 
the parity of an energy eigenfunction can be well defined (even or odd) 
and is constant in time. 

Energy Representation. It was shown in Sec. 22 that the set of 
Schrodinger energy eigenfunctions u k (x) may be thought of as a unitary 
ma trix that transforms the Hamiltonian operator from the r representa¬ 
tion into the diagonal form 


H kl = E k 8 kl or E k 8(h - l ) 


Although the discussion of Sec. 22 is valid for only one instant of time, it 
can be made valid for all time by using the time-dependent eigenfunctions 

__ iEkl 

u k (r)e h for the transformation, provided that II is a constant. The 
matrix representation in which II is diagonal is called the energy 
representation. 

The equation of motion (23.2) takes a particularly simple form in this 
representation, if F does not depend explicitly on the time: 

= A (EH - IIE)u = | (E>- E{)Fid 
at in n 


(23.27) 
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Equation (23.27) can be integrated to give 

i(Ek-Ei)t 

Fu(t) - Fy * (23.28) 

where Fh is the value of the matrix element at t = 0. Thus in the energj^ 
representation, the off-diagonal matrix elements of any time-independent 
function of the dynamical variables oscillate in time with frequencies that 
are related to the energy differences between the stationary states by 
Bohr’s frequency condition (see Sec. 2). 

24. ANGULAR MOMENTUM 

As an interesting and useful example of the direct treatment of 
dynamical variables by matrix methods, we now consider the properties of 
the angular-momentum operator. We work entirely with a representa¬ 
tion at a particular instant of time, so that we are not concerned here 
with the change in time of the resulting angular-momentum matrices. 
However, if the angular momentum commutes with the Hamiltonian, it is 
a constant of the motion, and the matrices retain their form for all time. 
It was shown in Sec. 23 that this is the case if the Hamiltonian is spheri¬ 
cally symmetric. 

Definition of Angular Momentum. We define the angular momentum 
M of a particle about some point in terms of its displacement r from that 
point and its momentum p as in Eq. (14.19) 

M = r X p (24.1) 

We do not now require that p be expressible as a differential operator, but 
rather that the components of r and p satisfy the commutation relations 
(23.16). It is then possible to find commutation relations between the 
components of M that do not involve r and p. 

[M z ,M y ] = (yp z - zp y ) (zp x - xp z ) - ( zp x - xp z ) (yp z - zp y ) 

= ypx(p*z — zp z ) + xp y (zp z — p z z) = ih(xp v — yp x ) 

We thus obtain 

[M m ,M v ] = ihM„ [M y ,M z ] = ihM x , [M S ,M X \ = ihM y (24.2) 

Equations (24.2) are seen to apply also to the total angular momentum 
of a system of particles, since the r and p operators for the individual 
particles commute with each other and so their angular momenta do also. 
Now it turns out that the relations (24.2) admit of some matrix repre¬ 
sentations for M that are not compatible with the original definition 
(24.1). When such a conflict arises, there is in some cases physical reason 
to regard (24.2) as being more fundamental than (24.1) (see the end of 
this section). 
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We note that the original definition (24.1) makes M Hermitian, since 
r and p are Hermitian. This is assumed to be true in general, since the 
components of M can be Hermitian without violating the commutation 
relations (24.2). 

Choice of a Representation. It is apparent from (24.2) that no two 
of the three components of M commute with each other, and so it is 
impossible to find a representation that diagonalizes more than one of 
them. However, all three components commute with 

M 2 = Ml + Ml + Ml 

for example, 

[M„ M 2 ] = M Z M\ - M\M Z + M Z M\ - M 2 M Z 

= ih(M x M y + M y M x ) - ih(M y M x + M x M y ) = 0 

Thus it is possible to diagonalize one component of M, say M z , and M 2 
at the same time; we use this to define our representation. 

It is then convenient to work with M z and the non-Hermitian matrix 

L = M x + iM y (24.3) 

from which it follows that 

M 2 = Ml + i(LL* + L*L) (24.4) 

The commutation relations involving L are found from (24.2) to be 

[MV„] = 0, [MM = hL , [L,L*] = 2 hM z (24.5) 

Our object is to find a representation in which M z and M 2 are diagonal. 
The rows and columns of this representation can be labeled by the eigen¬ 
values of these two dynamical variables, and we wish to determine these 
eigenvalues along with the corresponding matrix for L. The matrices 
for M x and M v are then given by solving (24.3) and its Hermitian adjoint 
equation: 

M x = *(/,* + L), M y = H(L* - L) (24.6) 

Relations between Matrix Elements. We label the rows and columns 
of our representation with a pair of symbols m and j. The eigenvalues of 
M z are mh, so that m is a dimensionless real number; j is related to the 
eigenvalues of M 2 in a way that is specified below [see Eq. (24.13)]. Then 
the first of Eqs. (24.5), written in matrix form, is 

(M 2 jL) m y, w ''j" (LM-) to ',-' iW ",-" = 0 
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If we work out the matrix products and remember that M 2 is diagonal, 
this becomes 

[(M 2 )/' - (M 2 )r]L mY> wY' = 0 (24.7) 

where (M 2 ),-' is the eigenvalue of M 2 that corresponds to j'. Equation 
(24.7) tells us that all matrix elements of L vanish except those for which 
j' = j n . Thus we can temporarily ignore j in specifying the L matrix, 
and remember when we finish that the matrix with which we come out is 
all for a particular value of j. This may be part of a larger matrix in 
which j can have various values, but there are no nonvanishing off- 
diagonal elements of L between the sections of the matrix that correspond 
to different values of j. 

It is therefore sufficient for the present to use only the symbol m to 
designate rows and columns of L. The second of Eqs. (24.5) then gives 

(M-zL^m' ,m" (Z/iW*) wl ', w " — 

or, since M z is diagonal with eigenvalues mh , 

(m' - 'in")hL rn ' im » = Mw.m" (24.8) 

Thus the only nonvanishing matrix elements of L are those for which 
m' = m" + 1, and we denote these by 

Tm-fi, m = A Ji (24.9) 


so that X m is a dimensionless number that may be complex. 

If now we take the mth diagonal element of the third of Eqs. (24.5), we 
obtain 



— = 2 Wm 


Each sum is seen to contain only one nonvanishing term, so that this 
equation becomes, with the help of (24.9), 


\\ m -i\ 2 — |X W | 2 = 2m 


(24.10) 


Eigenvalues of M z . Equation (24.10) is a first-order linear difference 
equation in |\ m | 2 , and its general solution has one arbitrary constant 


|X W | 2 = C — m(m + 1) (24.11) 

Now |X m | 2 is necessarily positive or zero, and yet the right side of (24.11) 
evidently attains negative values for sufficiently large positive and nega¬ 
tive values of m. This does not cause difficulty if there are t.wo values 
mi and m 2 of m for which \ m = 0, and if these two values differ from each 
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other by an integer. If this is the case, the series of th values in which 
successive terms differ by unity can terminate at both ends without 
|X W | 2 becoming negative. Equation (24.8) can be satisfied at the upper 
end (m = mi) by having L mi+1 , mi = 0 rather than by having an Ilf, eigen¬ 
value greater than m h and (24.8) can be satisfied at the lower end 
(m = m 2 ) by having L m . i+1 , ma = 0 rather than by having an M z eigen¬ 
value less than m 2 + 1. |X m | 2 is evidently nonnegative for m values that 

range from m 2 + 1 to mi, inclusive. 

We thus have a finite series of eigenvalues of M z ranging from m x 
down to m 2 + 1 by unit steps, where m x and m 2 are the larger and smaller 
roots of the quadratic equation C — m(m+l)=0: 

mi = —-f“ i(l + 4(7) 3 7 m 2 = ^ ir(l 4(7)* 

We rename mi and call it j, in which case C = j(j + 1), and the eigen¬ 
values of M s range from j to —j by unit steps. This implies that 2 j is a 
positive integer or zero, or that j is restricted to the series of values 0, 

113 

A >*i • • • • ' _ 

Eigenvalues of M 2 ; the L Matrix. Equation (24.11) can now be 
rewritten in terms of j 

|\„»| 2 = j(j + 1) — m(m + 1) = (j — m) (j + m + 1) (24.12) 

The eigenvalues of M 2 can be found by calculating a diagonal element of 
(24.4): 

= {m 2 + UiU + 1) ~ 1 >n+j(j + 1) - m(m + 1 )]}A a 

= j(j + 1)A* ( 24 * 13 ) 

We thus have an infinite number of representations for the matrices 
M 2 , M Z) and L, each of which is characterized by a zero or half-integer 
value for j and has 2 j + 1 rows and columns. As expected, the eigen¬ 
values of M 2 are all the same for a particular value of j. All of these 
representations may be taken together to form a single representation of 
infinite rank, although it is often more convenient to consider them 
separately. 

Equation (24.12) leaves the phase of the matrix elements of L arbi¬ 
trary. This corresponds to an arbitrariness in the choice of the phase of 
the angular-momentum eigenfunctions when they are normalized, and 
is not of physical significance. We therefore choose all the phases to be 
zero, and obtain for the nonvanishing matrix elements of L 

L (w+ = [(j — m) {j + m + 1 )]*h (24.14) 

For j = 0, M 2 and the components of M are all represented by null 
matrices of unit rank: (0). The matrices for the next three values of j, 
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as obtained from Eqs. (24.6), (24.13), and (24.14), are 


i = i 


j = 1 


J = 




M, 


o) 

= * A (o -?) 


; " w (? o) 

?) 


M. = 


m z = ih (; 


(24.15) 



Connection with the Spherical Harmonics. Comparison of the 
foregoing results with those of Sec. 14 suggests a close connection between 
t e matrix representations of the angular momentum for which j = l 
is an integer, and the spherical harmonics Yi m (d,4>) defined in (14 6) A 
comparison of Eqs. (14.22) and (14.23) with the M 2 and M z matrices 
shows that the corresponding angular-momentum operators of Sec. 14 
are .simply another representation of the matrices considered in this 
section. Ihis can also be shown for the L matrix by computing the 
effect of operation with L on a spherical harmonic. From (14.20) and 
(24.3), the A operator in polar coordinates is seen to be 


L = hei *(je + icot9 Ji) (24.16) 

It is then possible, by making use of the properties of the spherical har- . 
monies discussed in Sec. 14, to show that 

LYi m (6,4>) = ±[(l - m )(l + m + l)]» flYi >m+1 (0,*) (24.17) 
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where the minus sign is for m 0 and the plus sign is for m < 0 (the 
calculation has a somewhat different structure for the two groups of m 
values). Thus if we set up a matrix to represent the operator L in 
analogy with Eq. (22.5), we get just the matrix (24.14) with integer j, 
except for change of some of the signs, which are arbitrary anyhow. 

In similar fashion, we can show in analogy with Eqs. (22.7) and (22.8) 
that Yim (0,</>) can be regarded as a unitary matrix that transforms from a 
representation in which the rows and columns are labeled by the angle 
variables 6 ,</> to one in which they are labeled by the quantum numbers 
Z,m. The first representation is that which is implicit in the work of 
Sec. 14, and in which L would be represented as the result of operating 
with (24.16) on a suitably normalized 8 function of the angles. The 
second representation is that developed in this section, in which L is 
given by (24.14) except for some sign changes. 

Spin Angular Momentum. It follows from the work of Sec. 14 and of 
this section that if all of the integer j representations are taken together to 
form a single representation of infinite rank, the M matrices can be 
expressed by (24.1) in terms of r and p matrices that satisfy the commuta¬ 
tion relations (23.16). This is not true of the half-odd-integer j matrices, 
which are solutions of Eqs. (24.2) but not of the more restrictive Eqs. 

(24.1) and (23.16). Thus the matrices for the components of the orbital 
angular momentum of a particle or a system of particles must have eigen¬ 
values that are integer multiples of h , since they must be expressible in 
terms of coordinates and momenta. 

There is nothing to prevent a particle from having an intrinsic angular 
momentum that is described by Eqs. (24.2) but cannot be expressed in 
terms of the position and momentum of the particle in accordance with 

(24.1) . The eigenvalues of the components of such an angular momen¬ 
tum could be integers or half odd integers. Moreover, M 2 could have a 
single eigenvalue corresponding to a single value of j , since M 2 commutes 
with all three components of M and there is no reason why it should not 
also commute with r and p [since Eq. (24.1) is not valid in this case]. 
Thus M 2 could commute with all the dynamical variables that describe 
the particle, and so could be a constant of the motion under all circum¬ 
stances; there would then be no objection to equating M 2 to a definite 
number j (j + 1 )N l . This is not possible for an orbital angular momen¬ 
tum, since in that case M 2 docs not commute with r and p, and hence is 
not always a constant. 

An intrinsic angular momentum of the type described in the last 
paragraph is called a spin angular momentum. It is found experimentally 
that electrons, protons, and neutrons each possess a spin angular momen¬ 
tum for which j = % and the M matrices are given in (24.15). There is 
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also some reason to believe that particles, called mesons or mesotrons, 
exist that have masses intermediate between the electron and proton 
masses and spin angular momenta for which j is 0 ; or 1. 

Addition of Angular Momenta. It is sometimes of interest to con¬ 
sider the vector sum M = Mi + M 2 of two angular momenta Mi and M 2 
that commute with each other [all components of Mi commute with all 
components of M 2 , and Mi and M 2 separately satisfy the commutation 
relations (24.2)]. These angular momenta could refer to independent 
particles, or to the noninteracting spin and orbital angular momenta of 
the same particle. A representation in which Mf, M|, Mu, and M 2z are 
diagonal is easily obtained from the foregoing theory. The rows and 
columns are labeled byji, j 2 , mi, and m 2 ; the matrices for Mi, for example, 
have the form (24.15) so far as the indices j i and mi are concerned, and 
are unit matrices as regards the indices j 2 and m 2 . 

It is possible to find a second representation in which M?, M|, M 2 , 
and M z are diagonal, for which the rows and columns are labeled by 
j i, j 2 , j, and m [j(j + 1 )h 2 is an eigenvalue of M 2 and mh is an eigenvalue 
of M z ], If j i and j 2 are fixed, the first representation has 

(2ji + l)(2j2 + 1) 

rows or columns, and can be specified by products of eigenfunctions of 
Mu and of M 2z , just as the representation (22.6), in which the Hamil¬ 
tonian is diagonal, is specified by eigenfunctions u k of the operator H. 
The second representation, with the same fixed values of ji and j 2 , must 
have the same number of rows, since it is specified by eigenfunctions of 
M 2 and M z that are linear combinations of the original eigenfunctions. 
We now find the values of j and m that appear in the second representation. 

Eigenvalues of (Mi + M 2 ) 2 . Since M z = M iz + M 2z , it is apparent 
that the possible values of m are mi + m 2 . The largest value of m is 
therefore ji + j 2 , and this occurs only once, when m x = ji and m 2 = j%. 
This shows that the largest value of j is ji + j 2 , and that there is only one 
such state. The next largest value of m is j i + j 2 — 1, and this occurs 
twice: when mi = j x and m 2 = j 2 — 1, and when m 1 = ji — 1 and 
m 2 = j 2 (provided that neither j x nor j 2 is zero). One of the two linearly 
independent combinations of these two states must be associated with 
the new state for which j = ji + j 2 , since for that j value there must be m 
values ranging from j i + j 2 to —ji — j 2 by integer steps. The other 
combination cannot be associated with this or a larger j, since the larger 
m values that should then also be present actually are not. Therefore 
the second combination is associated with j = j 1 + j 2 — 1. By an 
extension of this argument we can see that each j value, from ji + j 2 
down to |ji — j 2 \ by integer steps, appears just once, and that with each 
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is associated 2 j + 1 linearly independent combinations of the original 
eigenfunctions. Thus the number of rows or columns of the second 
representation is 

31+32 

^ (2 j + 1) = (2ji + 1)(2j 2 + 1) 

i = tii —J 2 I 

as expected. 

The foregoing result is the same as the addition rule for angular 
momenta of the old quantum theory: the magnitude of the sum of two 
angular-momentum vectors can vary from the sum of their magnitudes 
(parallel case) to the difference of their magnitudes (antiparallel case) by 
integer steps. 

The unitary matrix that transforms from the mi,m 2 to the j,m repre¬ 
sentation for fixed values of ji and j 2 can be found by matrix methods. 
Since its structure is rather complicated it is not quoted here. 1 

Problems 

1. Assume that any Hermitian matrix can be diagonalized by a unitary matrix. 
From this, show that the necessary and sufficient condition that two Hermitian 
matrices can be diagonalized by the same unitary transformation is that they 
commute. 

2. Given two matrices A and B that satisfy the following equations: 

A* - O, A A* + A*A — 1, B = A*A 

where O is the null matrix and 1 is the unit matrix. Show that B 2 = B. Find A and 

B in a representation in which B is diagonal, assuming that it is non degen crate. Can 
A be diagonalized in any representation? 

3. Given three matrices A, B, and C that satisfy the following equations: 

A 2 = B 2 = C 2 =1, AB + BA = BC -f CB = CA + AC - O 

where 1 is the unit, matrix and O is the null matrix. Find all three matrices in a 

representation in which A is diagonal, assuming that it is nondegenerate. 

4. Given three matrices A, B , and C that satisfy the following equations: 

A 2 = B 2 - C 2 = 1, BC - CB = iA 

where 1 is the unit matrix. Show that A B +■ BA = AC 4- CA — O, where O is the 
null matrix. Find all three matrices in a representation in which A is diagonal, 
assuming that it is nondegenerate. 

6. Make use of the matrix expression (13.18) for x in a representation defined by 
the harmonic-oscillator wave functions to obtain a similar matrix for x 2 , using purely 
matrix methods. Compare with the answer to Prob. 3, Chap. IV. 

6. Use purely matrix methods to show that if x nm 0 for a harmonic oscillator, 
then E n — F m = ±fi(K/tx)K Note that fora harmonic, oscillator, II = p 2 / 2 M + \Kx 2 , 
xp — px — ifi. 

1 See E. XI. Condon and G. II. Shortley, “The Theory of Atomic Spectra,” Chap. 
Ill, See. 14 (Macmillan, New York, 1935); Chap. TII also discusses other interesting 
properties of angular momentum. 
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7. Show by purely matrix methods that if H - p 2 /2ju -f V(t ), 

h ’ 2 


i 


(E n — Em)\Xnm \ 2 = 




where the summation is over all states and a; is a cartesian component of r 

8. If H = p 2 /2/i + V(x) for a one-dimensional particle, and V(x) car 
as a power series in x, show by purely matrix mathods that 


dx __ p 

dt jj? 


dp = _ dV 
dt dx 


What is meant by the operator d/dt in this connection? 

9. Transform the Hamiltonian for the harmonic oscillator expres 
representation 

tt _ ft 2 d 2 , 

H= -2,M + * Kx * 

into the p representation. What are the solutions in the p represen tatio 
spond to the x representation wave functions (13.13)? 

10. A(x,p) and B(x,p) can be expressed as power series in x and p, an 
Show that 

K™k lA ’ B] = 

11. Verify Eq. (24.17) by direct operation on the spherical harmonic 
operator given in Eq. (24.16). 



CHAPTER VII 

APPROXIMATION METHODS FOR STATIONARY PROBLEMS 


In quantum mechanics, as in classical mechanics, there are relatively 
few systems of physical interest for which the equations of motion, are 
capable of exact solution. Approximation methods are expected to play 
an important part in virtually all applications of the theory. This 
enhances rather than diminishes the importance of those problems for 
which exact solutions can be found, since as was pointed out at the begin¬ 
ning of Chaps. IV and V, exact solutions are often useful as starting points 
for approximate calculations. In addition, they may also help to estab¬ 
lish limits of validity for various approximation methods. 

In this chapter and the next we develop several approximation 
methods and apply them to illustrative problems. It is convenient to 
divide the methods into two groups according as they deal with stationary 
states of systems that, arc represented by eigenfunctions of the energy 
(considered in this chapter), or with problems in which the Hamiltonian 
depends on the time (considered in Chap. VIII). In both cases we start 
with the Schrodinger wave equation, and only occasionally introduce 
matrix methods or notation. 

26. STATIONARY PERTURBATION THEORY 

The stationary perturbation theory 1 is concerned with finding the 
changes in the discrete energy levels and eigenfunctions of a system when 
a small disturbance is applied. It is assumed from the outset that the 
Hamiltonian // in the Schrodinger wave equation can be written as the 
sum of two parts. (>no of those parts Ho is of sufficiently simple structure 
so that its Schrodinger equation can be solved, and the other part H' is 
small enough so that it can be regarded as a perturbation on H o. It is 
convenient to retain our old symbols u n and E n for the supposedly known 
normalized eigenfunctions and eigenvalues of the unperturbed Hamilto¬ 
nian H o, and use \p and W for the perturbed stationary wave function and 
energy level: 

Tty = Wf, H = Ho + H', Hoiin. = EnUn (25.1) 

Nondegenerate Case. The assumption that II' is small suggests that 
we expand the perturbed eigenfunction and eigenvalue as power series in 

1 E. Schrodinger, Ann. d. Phydk, 80, 437 (1926). 
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H' This is most conveniently accomplished in terms of a parai 
such that the zero, first, etc, powers of X correspond to the zei 
etc. orders of the perturbation calculation. We replace H by X 
express and W as power series in X. We assume that these tw 
are analytic for X between zero and one, although this has not bee 
tigated except for a few simple problems. 1 The different order, 
perturbation approximation are then given by the coefficients 
responding powers of X. In the final results, X is set equal to 1. 

The perturbed wave function and energy level are written 

\f/ = + Xi/'i + XV2 X 3 !/^ + • 

w = Wo + \Wi + xw 2 + x s w 3 + • • • 

and are substituted into the wave equation to give 

(Ho + Xff')(* 0 + X*i + • • •) 

= (Wo + XFi + • • OW'D + Xv^i + • • •) 

Since Eq. (25.3) is supposed to be valid for a continuous range 0 
can equate the coefficients of equal powers of X on both sides to 
a series of equations that represent successively higher orders 
perturbation. 

Hq\Pq = Wqnpo 

H 0 \pi + = ^ 0^1 + Wifio 

#0^2 + H't 1 - W*P 2 + Witi + Wrfa, etc. 

The first of Eqs. (25.4) means that i/'o is any one of the unper 
eigenfunctions, as expected. We therefore put 

= M», Wo = 

This state u m is assumed to be nondegencratc, although others 
unperturbed eigenfunctions may be degenerate. The ease in whi 
unperturbed state \po is degenerate is considered later in this s< 
First-order Perturbation. It is implicit in the present, treatmer 
the unperturbed state u m is one of a discrete set. (even though the 
set of u' s may be partly continuous), since otherwise no interest 
attach to the calculation of the perturbed energy. In the next s 
we consider the perturbation of one of a continuous set of eigenfunc 
in connection with collision problems. 

We expand in terms of the u n 

'Pi = Sttn’W-n 

1 For a discussion of this point, see N. Arley and V. Borclisonius, Acta Ma 
261 (1945), especially Part IV. 
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where S denotes a summation over the discrete set together with an 
integration over the continuous set of eigenfunctions. Substitution of 
(25.6) into the second of Eqs. (25.4) gives 

S afi'HoUn + H'u m = EmS a n )u n + W lU m 

We replace H^u n by E n ii n in the first term, multiply by ilk, and integrate 
over all space, making use of the orthonormality of the w’s: 1 

a£\E m — Ei ;.) + IE idk, n = \ iii c H r u m dr = H'km (25.7) 

where the integral on the right is the km matrix element of the perturba¬ 
tion energy H' in the representation in which the unperturbed Hamilto¬ 
nian H o is diagonal [see Eq. (22.5)]. 

If we take k = m in Eq. (25.7), we see that 

Wi = H: m (25.8) 

which is the expectation value of H' for the state m. Equation (25.7) 
also gives 

aT = * * rn (25.9) 

We thus have a solution to first order in H f , except that a ( % is still unde¬ 
termined; it is obtained below from the normalization of 

Second-order Perturbation. The solutions to second order in H' are 
found from the third of Eqs. (25.4) by substituting in 

= S <"■» (25.10) 

to give 

S + Il'Sa^Un = E„S<>'n + WSa%n* + Wiu m 


As before, we replace //««„ by K n u n in the first term, multiply by u h and 
integrate over all space to obtain 

a<l>(K. - A'*) = S - F,«T - W,S km (25.11) 


If now we set k = m, we see with the help of (25.8) that 

ii r __ _ C' ^ \jjj rnn j_jj 

rK •> — O n 11 mn ~~ O V " ]/ O // __ W 

yj m — Jh n rj m ~ rj n 


(25.12) 


where the prime on S denotes t he omission of the term n = m from the 
summation and integration over n. Similarly, if k 9^ m in Eq. (25.11), 

1 jukii-ndr is equal to fan if either k or u is one of a. discrete sot, and is equal to 
5(/c — n) if both belong to a continuous wet.; in either case, ^njnj windr — /* (see Sec. 
10 ). 
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we obtain 

TT / TT ( 

12 km n mm 

{Em - E k y 

+ k 7 * m (25.13) 

IL m — jLk 

We thus have a solution to second order in H', except that a ( %, like a ( i 5 , 
is not yet determined. 

Normalization of Since yp 0 is chosen to be equal to u m , \j/ is already 
normalized to zero order. If then the normalization integral J ]\p\ 2 dr is 
set equal to unity for all orders of X, when \p is given by (25.2), we 
obtain 

KMi + $o'l'i)d,T = 0 in first order 

j(\p 0^2 + ^ 0^2 + |^i | 2 )dr = 0 in second order 

These give at once 

+ K = 0, a«> + a® + SKI' 2 = 0 


7 ( 2 ) 


= S' 


HLH. 


kn n nm 


{E m - E k ){E m - E n ) 


The real parts of a™ and a™ are fixed by these relations, but their 
imaginary parts are not. The choice of the imaginary parts of these 
coefficients is equivalent to the choice of a new phase for ^ in each order 
of the calculation; this in turn affects the phase of the next higher order 
term in \p. There is no loss of generality involved in making the simple 
choice of zero for these imaginary parts, in which case 

K = o, a* = -iSlKI 2 


It should be noted that the perturbed energies are independent of this 
choice. 

The energy and wave function to second order in H' are then (setting 
X = 1) 


w = E m + H,; m + SI '* 


E n 


* Um + E m - E k + {_S» 


TT t TT / 
^ kn-EJ- nm 


{E„ - E k y\ 


(E m - E k ){E,„ - E n ) 

\HLY- 


(25.1-4) 


u k - f 


(E m - E t ) 


", U/T 


It is interesting to note that the calculation of If to a given order in H f 
requires knowledge of 1 (/ only to the next lower order. 

Perturbation of an Oscillator. As a simple example of the application 
of the first- and second-order perturbation theory to a nondegenerate 
state, we consider the perturbation of the rath energy level of the linear 
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harmonic oscillator of Sec. 13 by an additional energy H' = ^bx~. The 
unperturbed Hamiltonian is H 0 = p 2 /2 M + %Kx 2 (the mass is denoted by 
n to avoid confusion with the quantum number m); the unperturbed 
eigenfunctions u m (x) given by Eq. (13.13) correspond to the eigenvalues 
E m = ( m + %)K k /p)\ where m = 0, 1 , 2, ... . This example is 
evidently a trivial one since the perturbed eigenfunctions and eigenvalues 
are given simply by replacing K by K + b in u m (x) and E m -, it is neverthe¬ 
less instructive. 

We require the matrix elements of x" between various pairs of har¬ 
monic-oscillator wave functions. These may be obtained with the help 
of the generating function (13.10) for the Hermite polynomials, as in 
Prob. 3 of Chap. IV, or more simply by matrix multiplication using the 
definition (21.3) and the expressions for x nm given in Eq. (13.18). We 
readily obtain 

(2a 2 ) _l [(m + l)(wi + 2)]&, n = m -f 2 
(2a 2 ) -1 (2 m + 1), n = m 

(2 a 2 ) -, [m(m — l)]i, n = m — 2 (25.15) 

0, otherwise 


(* 2 )> 


-I 


nl^km 


where a — (fiK/h 2 )K Substitution into the first of Eqs. (25.14) then 
gives for the energy to second order 


W = (m + i-)h 



b 2 1 
8 K 2 


in agreement with the expansion of (m + v)h[(K + b)/ /x]^ to second order 
in b . 

Degenerate Case. We now show that the foregoing treatment is 
incomplete when the unperturbed state m is degenerate. We suppose 
first that there is a state k that is degenerate with and orthogonal to the 
state m (72* = E m , $a k u m <h = 0). Then Eq. (25.7) tells us that HJ m = 0. 
Thus the first-order perturbation theory developed above is satisfactory 
only if II } ! m actually is zero. 

If now II = 0 and we suppose in addition that HU = Hj m , Eq. 
(25.11) tells us that 

S ^jil = S = o 

n 5^ k,m n 

In this case the degeneracy between the states k and m is not removed in 
first order, since the first-order perturbed energies for the two states 
( E k + HU and E m + HJJ are equal. Then the second-order perturba¬ 
tion theory developed above is satisfactory in general only if there exist 
no states n that connect the states h and m through the perturbation, 
i.e ., only if cither or both of H k ' n and Hj m are zero for all n. 
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We can summarize the situation by saying that the foregoing pertur¬ 
bation theory fails in first order if there is degeneracy of the unperturbed 
state in zero order and the perturbation energy H' connects the degenerate 
states in first order. Similarly, the theory fails in second order if there is 
degeneracy of the unperturbed state in first order and the perturbation 
energy connects the degenerate states in second order (through one or 
more other states). 

Removal of Degeneracy in First Order. Let us suppose that the 
perturbation removes the degeneracy of the unperturbed state m in some 
order. This means that there are at least two exact eigenfunctions of the 
Hamiltonian H = H 0 + X H f that have eigenvalues that are different 
when X 7 ^ 0 and become equal when X is made to vanish. Now we 
assumed earlier that the eigenfunctions are continuous analytic functions 
of X as X —»0; thus each of the eigenfunctions that are nondegenerate 
when X ^0, approaches a definite linear combination of the degenerate 
unperturbed eigenfunctions when X = 0. If these linear combinations 
are not the same as the unperturbed eigenfunctions on which the calcula¬ 
tion is based, the expansions (25.2) are not valid for X = 0, and the 
method developed above breaks down. 

It now becomes clear that we may treat degenerate unperturbed 
states by perturbation theory, if we first perform an exact diagonalization 
of as much of the perturbation matrix H' nl as is necessary to remove the 
degeneracy. This is equivalent to finding the linear combinations of the 
unperturbed eigenfunctions that fit on continuously to the exact per¬ 
turbed eigenfunctions when X is made different from zero. For example, 
suppose that only the unperturbed state k is degenerate with m, and 
H km = H' mk 0. In this case we need only diagonalize the submatrix 

(25.16) 

in order to remove the degeneracy (in this case it is removed in first 
order) and find the correct linear combination of u m and u k that can be 
used for higher order perturbation calculations. An explicit example of 
this type of situation is given below. 

Removal of Degeneracy in Second Order. It may, however, happen 
that H' km = 0 and H' kk = H' mmi so that the degeneracy is not removed in 
first order. A direct but unnecessarily complicated procedure then con¬ 
sists in diagonalizing the submatrix of the entire Hamiltonian that 
includes the rows and columns labeled by all subscripts n for which either 
H' mn or H kn is different from zero. Rows and columns of this matrix can 
be rearranged to bring together any ones that are desired. For example, 
when there are two such subscripts n and 2 , we can diagonalize 


(H' mm H' m A 
\H' km H'J 
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£L 4- H f 

0 

TJt 

11 tan 

H' m i 

0 

E m + H' mm 

HI 

H'u 

ZT/ 

11 nm 

H' nk 

E n + H' nn 

H' nl 

i HL 

H'ik 

Hi 

Ei + H\ 


(25.17) 


to remove the degeneracy in second order. 

A less direct but analytically simpler procedure consists in expanding 
the exact eigenfunctions in powers of X as in (25.2), (25.6), and (25.10). 1 
Because of the degeneracy, however, we now include both u m and u k in 
the zero-order term. 


1pm — Q'inM'm “f“ (Lk'U'k 4" $ (Xttj^ 4" X^&P^)Wj 
ipk = b m u m + bkUk + S^X^i 15 + \‘ 2 b\ 2) )ui 
ipn = U n + S(X<V + \' 2 a$)ui, n m,h 

ly^Tl 

where the prime on S indicates that l ^ m,k. Substitution of the first of 
these into the wave equation (Ph + iP)^» — W m ip m , where 


W m = E m + XTTff + \*W%\ 
gives to second order in X 

\a m H r u m + \a k H f Uk + $ '('ka^Em + \ 2 a\ 2) Em + WapH'ui) 

= (\W% + \ 2 W%)(a m u m + a k u h ) 

+ S 'frapEmUt + VoFEm + XWWWui) (25.18) 

We now multiply (25.18) through on the left by u m and integrate to 
obtain 

\aJV mm + S = XWa, + \ 2 W%a m (25.19) 

since JI' mlc = 0. Similarly, liq. (25.18) can be multiplied by u k and by 
u n (n m,/r), and integrated in each case to give 

\a,M' kk + S = \W$aic + XW«>a* (25.20) 

\a m H' nm + \ a JI' nk + \a™E n + \*a™E n + S^M l) H' nl 


= X a^E m + X 2 a( 2) E,„ + (25.21) 

The first-order terms in (25.19) and (25.20) give the expected result 
W a) = IV = 

r ' 7/1 u * mm 1 * kk 


1 J. H. Van Vlucik, Rhys. Rev., 33, 407 (1020), Son. 4. 
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S'aPH' ml = W%>a m 
SW* = W$a k 


[Chap. VII 


(25.22) 


The first-order terms in (25.21) give an expression for aP, where l = n ^ 
m,k: 

a? ) (E m — Ei) = a m H\ n + a k H[ k (25.23) 

Substitution of (25.23) into (25.22) gives a pair of homogeneous algebraic 
equations for cu and a*. These equations have a nonvanishing solution 
and only if the determinant of the coefficients of a m and a k is zero [see 
the discussion of Eq. (21.19)]: 


C/ HmlHlm _ t™ 2) 

° EL- Ei Wm 

C/ HuHL 

° Em - E t 




S' 


^ Em- E t 


l kl[± Ik 

E m — Ei 


- w% 


= 0 


(25.24) 


The secular equation (25.24) is of second order in this case, while the 
similar equation resulting from (25.17) is of fourth order. It is apparent 

diagon^izlti^n Ufe ****** l0W6r ° rder secular Rations than exact 

Equation (25.24) would also have been obtained if we had started 

ur ?2> the /w a S° n fC T t k rather than The two roots of (25.24) are 
hh a w Z' ’ T d the tW ° pairs of solutions of (25.22) are a m ,a k and 
6,„,6 a We thus obtain perturbed energy levels in which the degeneracy 

has been removed in second order, and also find the correct linear combi- 
ations of the unperturbed degenerate wave functions u m and u k 

First-order Stark Effect in Hydrogen A? py, mn i 

a n. , , , . ^ As an example, we now con¬ 

sider the first-order change in the energy levels of a hydrogen atom due 

(Stark fffeT ° f Strength E directed along the positive z axis 

Li aSLS L ST Hamiltonian ' or a 

TJ T72 

~% r ‘“7 

Tl*? th ! Tf maas <16 ' 4) ' W “ the «*"* -"M-gy of the nucleus 
and electron due to the external field and is readily shown to be 


H’ = —eEz = -eEr cos 0 
where the polar axis is in the direction of positive z. 


(25.25) 


The discussion of Sec. 14 showed that the wave functions for any 
spherically symmetric potential energy, when expressed in spherical 
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harmonics, have even parity when the azimuthal quantum number l is 
even, and odd parity when l is odd. Now the perturbation (25.25) is 
an odd operator, since it changes sign when the coordinates are reflected 
through the origin. Thus the only matrix elements of H' that fail to 
vanish are those for unperturbed states that have opposite parities; in 
particular, all diagonal elements of H r for the unperturbed hydrogen- 
atom wave functions given in (16.24) are zero. This shows that a non¬ 
degenerate state, such as the ground state (n = 1) of hydrogen, has no 
first-order Stark effect. 

The first excited state (n = 2) of hydrogen is fourfold degenerate; the 
quantum numbers l and m have the values (0,0), (1,0), (1,1), (1,-1). 
We now show quite generally that nonvanishing off-diagonal matrix 
elements of H' exist only for states that have the same quantum number 
m. It is apparent from (23.16) that z commutes with the z component of 
angular momentum M z = xp y — yp x , so that [M Zj H] = 0. The ks 
matrix element of this commutator in a representation in which M s is 
diagonal is (m k — m a )hHl a = 0, so that H ka = 0 unless m k — m 8 . Thus 
only the first two of the above four degenerate unperturbed states need 
be considered in calculating the first-order Stark effect of the first excited 
state of hydrogen. 

Perturbed Energy Levels. The perturbation submatrix that has to be 
diagonalized is of the form (25.16), where H n [ m = H kk = 0, and 

H£ m = —e£ J tZ 2 io(r)?’ cos 9 u 2 oo(r) dr 

= — f [ r 4 (2 -e-^ow 2 dwdr 

16a 0 4 Jo J -1 \ do/ 

—— 

where w = cos 6, and use has been made of (16.24). We now wish to 
transform this two-row sub matrix of H' from the representation in terms 
of u 2 oo and u 2 10 to a representation in which it is diagonal and has the 
eigenvalues Wi and W 2 - 

We follow the notation of Eqs. (22.3) and (22.5). The wave func¬ 
tions are V\ = u% 00 , v 2 = u 2 10 for the nondiagonal representation, and 
SnVi + S 12 ^ 2 , S-nVi + S 22 V 2 for the diagonal representation. Then in 
accordance with the discussion of Eq. (21.19), the eigenvalues of W are 
the two roots of the determinantal or secular equation 

H[ x -Wi H[ 2 -Wi 3eEa 0 

= 0 , i = 1,2 

H{ 2 H 22 — Wi 3eE«o -Wi 
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These are easily found to be F, = 3eEa 0 , W a = -3e£a 0 . The trans¬ 
formation matrix S can be found by writing out the matrix equation 


SH' = WS, W = 


(W, 0 \ 

V o wj 


together with the unitary conditions on S. The result contains arbitrary 
phase factors, which may be chosen equal to zero, in which case 


S = 2 ~ i I 


U -1, 


Thus two of the four degenerate states for n = 2 are unaffected by 
the electric field to first order, and the other two form linear combinations 
1 HW 200 + m 2 io) with extra energy 3eEa 0 in the electric field, and 

2 “ * («200 — U210) 

with extra energy -3eEa„. This means that the hydrogen atom in this 
unperturbed state behaves as though it has a permanent electric-dipole 
moment of magnitude 3ea 0 , which can be oriented in three different ways- 
one state parallel to the external electric field, one state antiparallel to 
me held, and two states with zero component along the field 

Occurrence of Permanent Electric-dipole Moments, 'as remarked 
above, a permanent electric-dipole moment (energy change proportional 
to E) can appear m hydrogen only when the unperturbed state is degen¬ 
erate whereas an induced electric-dipole moment (energy change propor- 
lonal to E ) can appear m any state (see Probs. 1 and 12). We now show 
t at the first conclusion is generally valid for any system that is described 
by a Hamiltonian that is unaffected by reflection of the coordinates of all 
particles through the origin. It follows from the discussion of Eq (23 26) 
that a nondegenerate state of such a system has definite parity (even or 
). Then since the eleetnc-dipole-moment operator is odd, its expec- 
ation value is zero. All the interactions between particles thus far 
encountered in physical theory lead to Hamiltonian functions that are 
unchanged by reflection. Since the ground states of all atoms and nuclei 
are very likely to be nondegenerate, 1 it is to be expected that an atom or a 

the qvatem ^ r ° m < ^ e S en ® 1 | a °y ^ ue t° the orientation of the total angular momentum of 

thesedeZemW 6 ; ^ riS0 t0 an elect ™-dipole moment because all 

snecLl svmmetrv 7 * ^e same parity, degeneracy either is associated with some 

spemal symmetry property of the system (such as the separability of the hydrogen- 
atom wave equation m spherical and parabolic coordinates) that is unlikely to occur 
for many-partide systems, or is accidental and hence very improbable on statisticJ 
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nucleus in its ground state will not possess a permanent electric-dipole 
moment, and none has ever been found experimentally. An extension of 
the foregoing argument leads to the expectation that an atom or a nucleus 
can possess electric charge, electric-quadrupole moment, magnetic-dipole 
moment, etc., but not magnetic pole, electric-dipole moment, magnetic- 
quadrupole moment, etc. (see also Prob. 19, Chap. XI). 

26. THE BORN APPROXIMATION 

In the preceding section we considered the perturbation of those 
energy eigenvalues, and the corresponding eigenfunctions, which are 
part of a discrete set. Here we are concerned with the perturbation of 
one of a continuous set of eigenfunctions. As in Chap. V, such eigen¬ 
functions are of interest in connection with the theory of collisions. The 
object of the calculation is not to determine an energy eigenvalue, which 
in this case can be fixed in advance, but to find the perturbed eigen¬ 
function and its relation to the scattering cross section. To simplify 
matters, we restrict our attention to those cases in which the entire 
potential energy of interaction between the colliding particles is regarded 
as a perturbation, and carry the calculation only to first order. As we 
shall see, this Born approximation 1 is best applied when the kinetic 
energy of the colliding particles is large in comparison with the inter¬ 
action energy. It therefore supplements the method of partial waves 
(Sec. 19), which is most useful when the bombarding energy is small. 

Perturbation Approximation. We wish to solve the wave equation 
for the relative motion, Eq. (18.8), 

- %- V-u + F(r )u = Eu, ii = —(26.1) 

2 n w wi + mj 

and obtain an asymptotic form like that given by Eq. (18.10), 

hW 1 

w(r,0» — c*- + r~ l f(0,4>)e lhr , E = ^ (26.2) 

We adopt the perturbation approach of Sec. 25, and put 

u(r) = c ikz + ^(r), (26.3) 


grounds. In some molecules, however, there is a group of nearly degenerate states 
between which matrix elements of the eleetrie-dipole operator exist; if these energy 
levels are elosely spared in comparison with either the thermal energy of the molecule 
or the energy associated with the applied electric, field, they give rise to a permanent 
electric-dipole moment [see J. IT. Van Vleck, "The Theory of Electric and Magnetic 
Susceptibilities,” p. 154 (footnote 28), Sees. 48, 70 (Oxford, New York, 1932)]. 

1 M. Born, Zeils. f. Physik , 38, 803 (1926). 
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where the scattered wave v(r) is to be a small addition to the unperturbed 
plane wave solution e iki . The term v(r) will be found only to first order 
in the scattering potential F(r); the Born approximation becomes quite 
arduous when carried to higher orders. 

Substitution of (26.3) into (26.1) gives 

(_ V 2 _ k * )v = _ U(r)tP“ - U{t)v, U(i) = ^ F(r) (26.4) 

Our assumption that v(r) is small in comparison with e ikz , or the roughly 
equivalent assumption that U( r) is small in comparison with /r 2 , leads us 
to neglect the second term on the right side of Eq. (26.4). We therefore 
have to solve the inhomogeneous wave equation 

(-V 2 - *>(r) = — U(r)e ikss (26.5) 

where the right side is known. A sufficient criterion for the validity of 
our solution is 

|y(r)| <<C |e a, | = 1, for all r (26.6) 

This condition, while always sufficient, is in some cases more stringent 
than is necessary for the Born approximation to provide useful results. 

Green’s Function. Rather than discuss Eq. (26.5) as a special case, 
we indicate a method of solution of the more general inhomogeneous 
partial differential equation 

(0 - co 0 >(r) = F( r) (26.7) 

Here £1 is a Hermitian operator that defines a complete orthonormal set 
of eigenfunctions u u ( r) with real eigenvalues w, and F( r) is a known func¬ 
tion of r. 

= COU^T) 

Jw w '(r)^ w (r)dr = 8(co — w') (26.8) 

fu u (r)u u (i')dw = 8(r — r') 

It is assumed for definiteness that the eigenvalues co form a continuous set. 
Equation (26.7) can be solved by expanding ^(r) in terms of 

v(i) = J A (a u (ti {x)dco (26.9) 

Substitution into (26.7) gives 

JA w (oj — wo )w w (r)dw = F( r) 

If we multiply this equation by w w '(r) and integrate over r, we obtain 

_ JMr)F(r)* 

/ 

CO — Wq 



Sec. 26] APPROXIMATION MET HODS FOR STATION ARY PROBLEMS 161 


Thus the solution of (26.7) can be written 

#(r) = ^G m (x,t')F(x')dr' (26.10) 

where the function 

<7 W0 (r,r') = [ M “ (r)t2 “ (r - ) dco (26.11) 

J 03 — Wo 

is called the Green’s function for the operator 0 and the number wo. 

Green’s Function for a Free Particle. The Green’s function (26.11) 
can be evaluated without great difficulty when the operator £2 is the 
Hamiltonian for a free particle. A suitably normalized eigenfunction of 
—V 2 corresponding to the eigenvalue k n is from (11.11) 

Mk'(r) = (2x)~ 3 exp ik' ■ r 

where k' is any vector of magnitude k'. Thus the Green’s function is 


G k ( r,r') = (2r) 


-/ 


exp (ik 7 • r) exp (—ik' ■ r 7 ) 
k' 2 - k 2 


dr k ■ (26.12) 


We perform the k’ integration in spherical polar coordinates with the 
polar axis in the direction of the vector p = r r’. 


f * fir file ik’poimO 

Gic(r,r') = (2x)" :t / / / pTZTfi Ic "' dk ’ sin eddd<t> 

Jo Ji) Jo * K 

- (W ff 

- (4»’»)-' 


(26.13) 


where <r = kp = k |r — r / 1 is a positive number. 

The singularities in the integrand of (26.13) at k = io* can be dis¬ 
cussed in terms of the corresponding singularity of the expansion coeffi¬ 
cient A u in (26.9) at co = o> 0 . The behavior of A u at w = co 0 cannot be 
found from Kq. (26.7) alone, since to any solution t>(r) of the inhomo¬ 
geneous equation can be added a solution u m (t) of the corresponding 
homogeneous equation. The addition that is to be made can be deter¬ 
mined only from the boundary conditions that are imposed on v(r). In 
similar fashion, the contribution to the integral in (26.13) from the infini¬ 
tesimal neighborhood of the points k = ±<r can be determined only from 
comparison of Eqs. (26.3) and (26.2), which shows that we want only 
those solutions v(r) that have the asymptotic form From 

the relation (26.10) between v(r) and G k ( r,r'), we see that we must evalu¬ 
ate the integral in (26.13) in such a way that it approaches e ier when <r is 
large. 
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Explicit Evaluation. The contribution to this integral from the 
infinitesimal neighborhood of the points k = ±o- can be adjusted by 
regarding it as a contour integral in the complex k plane. The main path 
of integration is along the real axis, and we now show that the choice of 
the contour near the two singularities determines the dependence of the 
integral on o\ 

Suppose that we choose the contour as shown in Fig. 21(a). The 
integral in (26.13) can be written 




ice ,K 


da — (2 i) 




ice~ 


<t)(k + O ') 


dK (26.14) 


Imaginary 

axis 


k plane 


__ Real 
-t-axis 


(a) 


o) (* + c r) 

The first integral can be evaluated by closing the contour with an infinite 

semicircle C in the positive imaginary 
half plane as in Fig. 21 (5), since the 
exponential becomes vanishingly small 
there and contributes nothing to the 
integral. Then the value of the first 
integral is 2iri times the residue of the 
integrand at the only pole (k = <r) that 
lies within the contour: irie i<T . 

The second integral in (26.14) can be 
evaluated by closing the contour with an 
infinite semicircle C f in the negative 
imaginary half plane as in Fig 21c. It is 
equal to —2 ni times the residue at the 
only pole (k — —o ■) that lies within the 
contour: —me' 1 *. Thus the entire expres¬ 
sion (26.14) is equal to ire ia . It is not 
difficult to see that the choice of any 
contour other than that of Fig. 21(a) 
would have given a term in addition 
to or in place of the term e ltT . Such a term 
in G corresponds to an incoming wave in 
v(j) } and is ruled out because of the 
asymptotic form (26.2) of the desired 
solution. 



ation of the integral in Eq. (26.13). 


Substitution into Eq. (26.13) shows that the Green’s function for a 
free particle (represented by the operator —V 2 ) is 


G A (r,r') = (47r|r — r'j) -1 exp zA’[r — r'| (26.15) 

Scattering Cross Section. From Eqs. (26.3), (26.5), (26.10), and 
(26.15) we obtain an approximate solution of the wave equation (26.1) 
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w (r) = e ik * - (4ir) -1 /|r - r'| -1 e ik *-^e ik *U(j')di' (26.16) 

The second term in (26.16) has the form of a superposition of waves 
scattered from all points r' with amplitudes proportional to the product 
of the incident wave amplitude and the scattering potential at those 
points. 

We assume that U (r') falls off rapidly enough for large r' so that there 
is an asymptotic region in which r is large in comparison with those values 
0 f r ' for which there is a significant contribution to the integrand. We 
can then put 

l r “ r 'l-*»’ - wr', |r - r'|-‘ - + ^ 

where w is the cosine of the angle between the vectors r and r'. Thus the 
asymptotic form of (26.16) is 

u(r) —> e‘ ik3 - {^Trr)~ l ^ kr {r , )e k ^ , - wr,) d^ (26.17) 

r —+ oo 

Comparison of (26.2) and (26.17) shows that the scattered amplitude 
is 

= — (47T) -1 J U {x f )e jk ^ r ~ u ' T,) dT r 

= — (47r)“ 1 Jt/(r') exp zX ■ r'dr' (26.18) 

Here we define a vector K — k 0 — k, where k 0 is the vector of magnitude 
k that has the direction of the incident beam (polar axis), and k is the 
vector of magnitude A* that has the 
direction of the polar angles (0,</>) of 
the point at which the scattered ampli¬ 
tude is measured. Figure 22 shows 
these three vectors; the magnitude of 
K is evidently 2 k sin £-0. Its physical 
significance is that //X is the momentum 
transferred from the incident particle 
to the scattering potential during the 
collision. Thus il a Fourier integral 
analysis of the scattering potential into 
harmonic, space waves is made, it is 
apparent from the second integral in (26.18) that the scattered amplitude 
in a particular direction is proportional to the Fourier component of the 
scattering potential that corresponds to the momentum change of the 
particle during the collision. 

The differential scattering cross section is given by Eq. (18.11): 

c km = < 26 - 19 ) 



gation vectors ko for the incident par¬ 
ticle, k for the scattered particle, and 
the angle of scattering 0. The momen¬ 
tum transfer in the collision is TiK, 
which has the magnitude 2 tik sin \6. 
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In the event that JJ( r) = U(r) is spherically symmetric, the integrals 
over the polar angles of r' in (26.18) can be evaluated by taking the direc- 
lion of K as the polar axis. Then 


/(0) = —K 1 r' sin Kr' U(r')dr' (26.20) 

As expected, the scattered amplitude is independent of the angle <fi It 
is interesting to note from (26.19) and (26.20) that the amount of scatter¬ 
ing depends on the bombarding speed v and the scattering angle 6 only 
through the combination K oc v sin ^6. 

Perturbation Treatment of Partial Waves. When U(r) is spherically 
symmetric, the wave equation (26.1) can be separated in spherical coordi¬ 
nates, as was done in Sec. 19, and solved approximately by a perturbation 
method. The radial wave equation for the Ith partial wave is 


1 ^. 
r 2 dr 



1(1 + 1 ) 

r 2 



(26.21) 


* (263 >’ « P" 1 -.)« + xi(r), where ji(tcr) given by 

Eq. (15.5) is the unperturbed solution. The approximate equation for 
XI is found from (26.21) to be 


j* 2 dr 



1(1 ± 1) 

r- 


Xi = U(r)ji(kr ) 


(26.22) 


where a term U(r) X i(r) has been neglected. 

Equation (26.22) is an inhomogeneous total differential equation of 
which the right side is known. The solution is most readily obtained by 
putting xi(r) - vi(r)j,(kr), in which case Eq. (26.22) can be written 


■ dr = r*fi(kr)U(r) (26.23) 

Equation (26.23) can be integrated to give 


dv f 

r'jKkr) £ = J Q r-j\{kr) U(r)dr + C (26.24) 

where the constant C of integration must be chosen equal to zero since 
dvi/dr is finite at r = 0. 

The asymptotic form of (26.24) can be obtained if the integral on 
the right converges with infinite upper limit. Because of the asymptotic 
lorm (15.8) of Jh this integral converges when the Born approximation 
integral (26.20) converges; i.e., when U(r) falls 0 ff more rapidly than 
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1 /r. In this case the asymptotic form of dvi/dr is 

sec 2 [Jr - i(Z + l)x] J q r 2 jf(kr)U(r)dr (26.25) 
Integration of (26.25) gives 

Vi(r) -> k tan [kr — %(l + l)x] f ” r 2 j\(kr)U(r)dr + C 

from which we obtain 

Ri(r) -> (kr )- 1 {cos [kr — %(l + l)x](l + C') 

r—± oo 

+ sin [kr - i(Z + l)x] jj kr 2 j!(kr)U(r)dr} (26.26) 

Phase Shifts, xi was originally assumed to be small in comparison 
with ji } so that the constant C' of integration is small compared to unity. 
The phase shift 5i is defined by the asymptotic form 

(At ) -1 cos [At — £(l + IV + h] 

for Ri given in (19.8). It is also small, and from (26.26) is given approxi¬ 
mately by 

Si K ~ J Q “ kr°if (kr) U (r)dr (26.27) 

Equation (26.27) is the perturbation or Born approximation expres¬ 
sion for the phase shifts. If all these <5z are actually small, they can be 
substituted into Eq. (19.11) for f( 0 ) with the approximation 

e 2iSl - 1 ^ 2 ib h 

in which case 

oo 

f( 0 ) c* k ~ 1 £ (21 + l)5iPi(cos 6 ) 

/ = 0 

oo 

= - f°° r 2 U(r) [ ^ (2Z + l)fl(kr)P t (cos 0)] dr (26.28) 

Z = 0 

The. summation in brackets can be shown 1 to be equal to sin Kr/Kr, 
where K = 2k sin ^0, so that (26.28) is equal to the Born approximation 
amplitude (26.20), as expected. 

The foregoing perturbation treatment of partial waves is of practical 
interest since it has been found in some cases that substitution of the 
approximate phase shifts (26.27) into the exact expression (19.11) for 

1 G. N. Watson, “Theory of Bessel Functions,” 2d ed. p. 366 (Macmillan, New 
York, 1945). 
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the scattered amplitude results in an improvement on the Bom approxi¬ 
mation amplitude (26.20) when the 81 are not all small in comparison with 
unity. In general, it is much easier to evaluate the integral in (26.27) 
than to find the phase shift exactly from a solution of the radial wave 



y 

Fig. 23. (a) Angle distribution function for scattering by a square-well potential, as given 

by Eq. (26.29); (6) total cross section function given by Eq. (20.30). 

equation. Also, the Born approximation amplitude (26.20) can be used 
as a device for summing the partial wave series for large l where the 5i are 
small; suitable modifications can then be made for the low terms of the 
series. 

Scattering by a Square Well Potential. As a first example of the 
application of the Born approximation result (26.20), we consider the 
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scattering by a square well potential: V(r) = — Vo, r < a, V(r) — 0, 
r > a. Substitution into (26.20) gives 

f(6) = (sin Ka - Ka cos Ka), K = 2 k sin id 


Thus the differential scattering cross section is 

- <Si ° ^ ~J C ° SJ) ’ 06.29) 

The function g(x)/g( 0) = 9 g(x) is plotted in Fig. 23a. At high energies 
(ka » 1 ), the scattering shows a strong maximum in the forward direc¬ 
tion so that most of the scattered particles are in a cone whose angular 
opening is of order 1 /ka. 

The total cross section is most easily evaluated by changing the 
variable of integration from 0 to x = Ka = 2 ka sin id, in which case 
sin Odd is replaced by xdx/(ka)-. Equation (26.29) then gives 


a 


y(y) 




2 x 

(ka)' 


r2ka 

/ g(x)xdx 

J o 


32t rM 2 V?,a 6 
h A 


7 (2/ra) 


1 f v (sin x — x cos x) 2 7 

¥ Jo * 

J_/ _ 1 , sin 2 y _ sin 2 y \ 

4 ?/ 2 \ ?y- y A y 4 / 


(26.30) 


Since 7 ( 0 )=^ and 7 ( 7 /) approaches l/4z / 2 as 7/ becomes large, the total 
cross section is Hbrin the limit ka 1 and becomes 


iruVW 

h 2 E 


when the bombarding energy E measured in the center-of-mass system is 
large. The function 7 (?y)/y(0) = I 87 (?/) plotted in Fig. 236. 

Validity of the Born Approximation. A convenient criterion for the 
validity of the Born approximation as applied to the foregoing problem 
can be set up by using lOq. (20.6) and assuming that ^(r) is largest at the 
center of the scattering potential. This condition is probably sufficient, 
but may be more stringent than is actually required; for exam pie, the small- 
angle scattering (small momentum transfer) may be given correctly 
when the large-angle scattering is not. From (26.16), our criterion is that) 
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K°) I = ^ |/ q y_ i e ik ^+-W{r)rdrdw 

= WklJo’ ^ ~ ^ V ^ dr 

~ 2PP \ e2lk * ~ 2 ^ a — 1 1 

= Wh - 2 V sin V + 2 - 2 cos « 1 , y = 2 A-a (26.31) 

.In low-energy limit (*a « 1), (26.31) becomes M F 0 a 2 A 2 « 1, 
while m the high-energy limit (ka » 1 ) it becomes 

_ Foa ,, 1 
h 2 k hv ’ 

where v is the speed of the incident particle. If the square well potential 
is about ‘strong” enough to bind a particle (as shown in Sec. 15 this 
requires that /»7oa*/A* « 1 ), the Born approximation may therefore be 
used for the computation of the scattering only when ka» 1 . Thus the 
Born approximation supplements the method of partial waves (Sec. 19), 
which is most useful when ka is less than or of order unity. 

The qualitative features of the results obtained here and above for a 

square well potential apply to any potential that possesses a well-defined 
range. 

Scattering by a Screened Coulomb Field. As a second example of the 
application of the Born approximation, we consider the elastic scattering 
of an electron by a neutral atom that is represented by a simple form 

of screened Coulomb potential : 1 F(r) = ~(Ze */r)e"«. This potential 
energy behaves like the nuclear Coulomb potential for atomic number Z 
when r is small, and falls off rapidly when r is large in comparison with the 
radius” a of the atomic electron cloud that screens the nucleus. The 
Thomas-Fermi statistical theory of the atom (see Sec. 38) shows that for 
moderately heavy atoms, a is roughly equal to h*/me*ZK where m is the 
electron mass. 

Substitution of this potential into (26.20) gives 

a\ 2 mZe 2 f °° . -Z 2 mZe 2 

m = ~mr Jo sm Kr e ° dr = W { K> + O ’ K = 2* Sin ie (26.32) 

* *' or , a more detailed discussion of this problem, see N. F. Mott and H. S. W. 
Massey, 'The Theory of Atomic Collisions,” Chap. IX, Secs. 3, 4 (Oxford, New York, 
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This gives a cross section in agreement with the Rutherford result (20.11) 
when, the momentum transfer is large enough so that 1 /a 2 can be neglected 
in comparison with K 2 in the denominator; in the analogous classical 
situation, the particle passes close to the nucleus, so that the screening 
electrons are relatively ineffective. Equation (26.32), unlike the Ruther¬ 
ford result, yields a finite cross section at vanishingly small angles; the 
analogous classical particles pass far from the nucleus and are well 
screened from it by the atomic electrons. The total cross section is 


a 



2 ttKcIK 

F~" 


I67rm 2 ZVa 4 
h\4k 2 a 2 + 1) 


With the above Thomas-Fermi expression for a , this becomes AirZi/k 2 
at high energies (ka » 1), which agrees in order of magnitude with the 
result of a numerical integration 1 of the scattering produced by the 
Thomas-Fermi potential. 

The criterion (26.31) for the validity of the Born approximation 
becomes 


2 mZe 2 

~wr 


L 


tX—r— dX\ 

sin x e ka — 
x 


« 1 


where x replaces kr as the variable ot integration. For ka <3C 1, this 
becomes 2 mZcM/fi 1 « 1, which with the above approximate expression 
for a is equivalent to Z* « 1; thus the Born approximation cannot be 
used for the scattering of slow electrons by atoms. For &a» 1, the 
criterion becomes (Zc 2 /hv) In ka <3C 1. Since it turns out that this result 
is substantially unaffected by relativity theory the approximation 
becomes poor for the heavier elements, where 


Ze 2 

he 137 


becomes comparable with unity. 

It is interesting to note the close correspondence between the various 
results for the square well potential and tor the screened Coulomb field 
when a is chosen the same in the two cases and Vo ~ Ze 2 /a. 


27. THE VARIATION METHOD 

The variation method has been used mainly for the approximate 
determination of the lowest or ground-state energy level of a system. 2 
It can be used when there is no closely related problem that is capable of 

1 E. 0. Hullard and H. S. W. Massey, Proc. Camb. Phil. Soc., 26, 556 (1930). 

2 It has also been applied recently to collision problems by J. Schwinger, Phys. 
Rev., 72, 742 (1947). 
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exact solution, so that the perturbation method is inapplicable. The 
variation method can also be applied to systems that are described by a 
nonseparable Schrodinger equation, in which case numerical solutions are 
extremely arduous and the WKB method (Sec. 28) cannot be used. 

Expectation Value of the Energy. It was shown in Sec. 10 that if an 
arbitrary normalized function \f/ is expanded in energy eigenfunctions 

\p = ) A e ue , where Hu E = Eue (27.1) 

E 

and the u E form a complete orthonormal set, then the expectation value of 
H for the function ^ is given by 

(H) = J Wdr = l E\A,\ 2 (27.2) 

E 


where the integration is extended over the entire range of all the coordi¬ 
nates of the system. It is assumed for convenience in Eqs. (27.1) and 

(27.2) that the energy eigenvalues are all discrete; this can be accom¬ 
plished by enclosing the system in a box (Sec. 10), or else the summation 
can be replaced by the symbol $ (Sec. 22). 

A useful inequality can be derived from Eq. (27.2) by replacing each 
eigenvalue E in the summation on the right side by the lowest eigenvalue 
Eq'. 

(H) St ^ E Q \A^ = E 0 £ \A»\* (27.3) 

E E 

Since ^ \Ae\ 2 = 1 for a normalized function as was shown in Sec. 10, 

E 

(27.3) yields the inequality 

E 0 ^ WH^dr (27.4) 


In the event that is not normalized, (27.4) evidently can be rewritten as 


E o 


< WH+dr 

= M 2 dr 


(27.5) 


The variation method 1 consists in evaluating the integrals on the 
right side of (27.4) or (27.5) with a trial function \p that depends on a 
number of parameters, and varying these parameters until the expecta¬ 
tion value of the energy is a minimum. The result is an upper limit for 
the ground-state energy of the system, which is likely to be close if the 

1 The method was originally applied by Lord Rayleigh in 1873 to the computa¬ 
tion of the vibration frequencies of mechanical systems—“Theory of Sound/’ 2d rev. 
ed., vol. 1, Sec. 88 (Macmillan, London, 1937; reprinted by Dover, New York). 
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form of the trial function resembles that of the eigenfunction (see Prob. 9). 
Thus it is important to make use of any available information or physical 
intuition in choosing the trial function. 1 

Application to Excited States. The variation method can also be 
used to obtain an upper limit for one of the higher energy levels if the 
trial function is orthogonal to the eigenfunctions of all the lower states. 
Suppose that the energy levels are arranged in an ascending series: 
Eo, Ei, E 2 , . . . - Then if p is orthogonal to u Ei for i = 0, 1, . . . , n, 
it is easily seen from (27.1) that the corresponding expansion coefficients 
A Ei are all zero. An inequality can be derived from (27.2) by replacing 
each eigenvalue E in the summation on the right by En+i, with the 
result that the expectation value of the energy is an upper limit on this 
eigenvalue. 

The trial function p — u EQ fuE 0 pdr is evidently orthogonal to u Ea , so 
that if the lowest eigenfunction is known either from an exact solution or 
to a sufficiently good approximation from a variation calculation, an 
upper limit for the energy of the first excited state can be computed. 
Trial functions that are orthogonal to any number of known eigenfunc¬ 
tions are easily found in this way. 

It is sometimes possible to divide the energy eigenfunctions into 
groups such that any member of one group is orthogonal to any member 
of any other group. Suppose that there is a Hermitian operator F that 
commutes with H (FII — HF = 0); then from a theorem of Sec. 21, F 
and H can be diagonalized simultaneously and have common eigenfunc¬ 
tions. Now any two eigenfunctions ot F that correspond to different 
eigenvalues are orthogonal to each other. 2 Thus a trial function that is 
constructed entirely from eigeniunctions of F that correspond to a given 
eigenvalue is orthogonal to all other eigenfunctions that correspond to 
different eigenvalues of F, and will provide an upper limit for the lowest 
energy eigenvalue that is associated with this eigenvalue of F . The fore¬ 
going results arc useful when the operator F is one whose eigenfunctions 
are easily recognizable by some simple property, such as, for example, the 
symmetry in case F is the angular momentum or the parity, then a 
trial function with angular dependence corresponding to a particular 
angular momentum, or with a particular parity, can easily be written 
down, and gives an upper limit for the lowest energy level that has this 
angular momentum or parity. 


1 For an extension of this method that gives both upper and lower limits, see D. II. 
Weinstein, Proc. Nat. Acad. SrL, 20, 520 (1034). 

2 This is shown explicitly in Fq. (10.4) for the energy operator, and the proof given 
there is easily extended to any Hermitian operator. 
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Ground State of Helium. As a first example, we use the variation 
method with a simple trial function to obtain an upper limit for the energy 
of the ground state of the helium atom. The helium atom consists of a 
nucleus of charge +2e surrounded by two electrons; from Eq. (16.1) 
we find that its Hamiltonian is (neglecting the motion of the nucleus) 


+ + + (27 - 6) 


where r x and r 2 are the position vectors of the two electrons with respect 
to the nucleus as origin, and r X2 = |r x — r 2 | is the distance between the 
two electrons. 

If the interaction energy e 2 /r X2 between the two electrons were not 
present, the ground-state eigenfunction of H would be the product of 
two normalized hydrogenic wave functions ^ioo(ri)^ioo(r 2 ) given in Eq. 
(16.24): 


lK*i,r*) 


Z Z -('^')(ri+r2) 
—* e ' ao * 

7 T a\ 


(27.7) 


with Z = 2. We shall use (27.7) as a trial function, and permit Z to be 
the variation parameter so that it is not necessarily equal to 2. 

It follows from Prob. 13, Chap. IV, that the expectation values of the 
kinetic and potential energies for the ground state of a hydrogen atom are 
e 2 /2a 0 and — e 2 /a 0 , respectively; the corresponding hydrogen wave func- 

r 

tion is (7rajj)~ *e ao . The expectation value of either of the kinetic energy 
operators in (27.6) for the function (27.7) is obtained most easily by not¬ 
ing that operation with the Laplacian gives a result that is inversely 
proportional to the square of the length scale of the wave function; since 
the scale of (27.7) is smaller than that of the hydrogen wave function by a 
factor of Z, the expectation value of each of the kinetic energy operators 
is e 2 Z 2 /2a 0 . Similarly, the factors 1/r make the expectation values of the 
nuclear potential energy operators inversely proportional to the length 
scale; there is also an additional factor 2 from the nuclear charge, so that 
each one is — 2e 2 Z/a 0 . 

Electron Interaction Energy. The expectation value of the inter¬ 
action energy between the electrons is 

J j ^(ri,r 2 ) ^(ri,r 2 )dridr 2 

-Q (27 - s) 
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This integral is most easily evaluated by regarding it as the mutual elec¬ 
trostatic energy of two overlapping, spherically symmetric, charge dis¬ 
tributions, in which case simplifications from the theory of electrostatics 
can be introduced. 

A more general way of performing the integration, which can also be 
used for wave functions that are not spherically symmetric, consists in 
expanding l/ri 2 in spherical harmonics. 


?*12 


eo 



Pi (cos d), 



ri > r 2 


n < r t 


(27.9) 


where 9 is the angle between ri and r 2 , cos 6 = cos 6 1 cos 0 2 + sin 6 1 • 
sin cos (4>i — 4 >i), and 6 h 4>i and 0 2 ,</>2 are the polar angles of the vectors 
ri and r 2 , respectively. 1 It can be shown 2 that 
Pi(cos 9) = Pi (cos 0i)Pi(cos 0 2 ) 

i 

2 X ['!'+ " mfl p r(cos 0i)Pf(cos 6-i) cos - fa) (27.10) 

m — 1 

When (27.9) and (27.10) are substituted into (27.8) and use is made 
of the orthogonality of the spherical harmonics, the integration over the 
polar angles of ri causes all terms to vanish except that for which l and 
m are zero. The integral on the right side of (27.8) becomes 



2 z 

a 0 


(i'1+r- 1 ) 


r\dri + £ 


1-(ri-fra) 

— e a “ 
r 2 



r\dr\ 


which can be evaluated as 5T 2 a§/8Z ft . Thus the electron interaction 
energy has the expectation value 5c 2 Z/8a 0 . 

Variation of the Parameter Z. We now have the result that the 
expectation value of the Hamiltonian (27.6) for the trial function (27.7) 
is 



4cJZ 5 eJZ 

do 8a 0 


= / (Z 2 - v-Z) 

a o 


1 Equations (27.9) follow at once from the generating function (14.10) for the 
Legendre polynomials; the expression for eos 0 is simply obtained from the scalar 
product of the vectors r! and r-j in rectangular coordinates. 

2 E. T. Whittaker and G. N. Watson, “ A Course of Modern Analysis/’ 4th ed., 
p. 328 (Cambridge, London, 1935). 
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Differentiation with respect to Z shows that this is a minimum when 
Z = f-g- = 1.69. Thus the lowest upper limit for the ground-state energy 
of the helium atom obtainable with this trial function is 


■(H) 2 « = 

do 


-2.85- 

CLq 


The experimental value for the energy required to just remove both 
electrons from a helium atom is 2.904e 2 /ao, so that our limit is about 1.9 
per cent high. The most careful variation calculation of the ground-state 
energy of helium gives a result in excellent agreement with experiment, 1 
and provides an important verification of the theory of quantum 
mechanics. 

The result that hydrogenic wave functions give the best energy value 
when Z = H rather than 2 indicates that each electron screens the 



Fig. 24. Two hydrogen atoms, with nuclei at A and B separated by a distance R, have 
electrons at 1 and 2; their interaction is given by H' in Eq. (27.11). 

nucleus from the other electron, the effective nuclear charge being reduced 
by tf of an electronic charge. 

If the electron interaction term e 2 /ri 2 is regarded as a perturbation, 
the first-order perturbation energy is given by (IT) with Z = 2 and is 
— 2.75e 2 /ao, which is 5.3 per cent above the experimental value. It 
is apparent. that, in general, the first-order perturbation calculation is 
equivalent to a nonoptimal variation calculation. 

van der Waals Interaction. As our second example of the application 
of the variation method, we calculate the van der Waals (long-range) 
interaction between two hydrogen atoms in their ground states. It is 
convenient to consider this problem first by means of the perturbation 
theory, since it is then easier to see that the leading term in the energy at 
great separation distances varies inversely as the sixth power of this 
distance. Also, it turns out that the perturbation theory and the varia¬ 
tion method provide opposite limits for the coefficient of this term. 


1 E. A. Hyllcraas, Zeits. f. Physik , 65, 209 (1930). The host value obtained by 
Hylleraas is slightly lower than the experimental value, contrary to theoretical expecta¬ 
tion; the discrepancy must be attributed to a numerical error in the calculation, to 
experimental error, to some small interaction energy that was not taken into account, 
or to an inadequacy of the theory [see H. A. S. Eriksson, Nature, 161, 393 (1948)1. 
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We assume that the nuclei of the two hydrogen atoms, A and B, are 
fixed in space a distance R apart, and that the z axis is chosen parallel to 
the line through A and B. Then if ri is the vector displacement of elec¬ 
tron 1 from nucleus A and r 2 is the vector displacement of electron 2 from 
nucleus B (see Fig. 24), the Hamiltonian for the two electrons can be 
written 


Ho = 
H' = 


H = Ho + H' 

h 2 p 2 

- IL (v? + v|) - - 

2 m v 1 2/ ri 

t_ . £_ - il _ £. 

R T 12 Y IB 7*2 A 


T 2 


(27.11) 


The unperturbed Hamiltonian Ho has the solution 
wo(ri,r 2 ) = 'Uioo(ri)wioo(r 2 ) 


for two noninteracting hydrogen atoms in their ground states. We 
regard the interaction terms H' as a perturbation; this is equivalent to 
assuming that R a Q . 

Since we are interested in the leading term in the interaction energy 
when R is large, we expand H' in powers of l/R and keep the lowest 
terms. 


H f 


_ £? \ 
R 1 


1 + 


1 + 


2 (z 2 — 21) , Qr 2 ~ %i ) 2 + (?y2 — y i) 2 + (^2 — gQ 


- 


R 


i 

-('-tt + A) ‘-( 1 +w + £) 1 


= (^2 + 2 / 12/2 ~ 2 Z ! Z 2 ) 


(27.12) 


The last term is the interaction energy of two electric dipoles that cor¬ 
respond to the instantaneous configurations of the two atoms. 1 

It is apparent at once that the expectation value of the leading term 
in Ii' for the state Wo(ri,r 2 ) is zero, since Wo is an even function of ri and 
r 2 and II' is an odd function of ri and r 2 separately. It can also be shown 
that all the neglected higher terms in //' have zero expectation value for 
Wo, since these terms can be expressed as spherical harmonics of order 
different from zero. Thus the leading term in the interaction energy is 
the second-order perturbation of the dipole-dipole term, which is pro¬ 
portional to Ii'- and hence varies like 1/A ><5 . 


1 The neglected terms in the expansion (27.12) that vary like l /R* are the dipole- 
quadrupole interaction; the \/H 6 terms are the. quadrupole-quadrupole interaction, 
etc. 
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Perturbation Calculation. From Eq. (25.12), the second-order 
change in the energy of the two hydrogen atoms is 

W(R) = S' (27.13) 

where the index n refers to all states of the pair of unperturbed hydrogen 
atoms (including dissociated states), and the ground state u 0 is excluded 
from the summation and integration that is denoted by S* If is appar¬ 
ent that W(R) is negative, since Eq < E n and the numerator of each term 
in (27.13) is positive. We thus conclude that the interaction is attractive 
and proportional to 1 /R & , when R is large; both these conclusions can be 
shown to be valid for any pair of atoms that are in nondegenerate, 
spherically symmetric ground states. 

We can obtain an upper limit on the positive quantity —W{R) by 
replacing each E n in (27.13) by the energy E n * of the lowest excited state 
of the two hydrogen atoms for which H ' 0 «* is different from zero. Then 
the denominator can be taken outside of the summation, which can be 
evaluated as a matrix product 

SX »| 2 = S'H'A = S«0 - tf'oo = (ff'% - 
Since we have seen that H' 00 = 0, we have that 

-W(R)zAE?h «. (27.14) 

The state n* is that in which both atoms are excited to states of princi¬ 
pal quantum number 2, so that E 0 = — 2(e 2 /2a 0 ), E n * = — 2(6 2 /8a 0 ), 
and E n * — Eq = 3e 2 /4ao. From (27.12) we have 

H n = ^ (x\x\ + y\y\ + Az\z\ + 2x 1 x 2 yiy-> - ■ ■ ■ ) (27.15) 

The expectation value of the cross-product terms like xix^yiy^i is zero 
since these terms are odd functions of one of the cartesian components of 
ri or r 2 . The first three terms in the parenthesis of (27.15) are each the 
product of two identical factors that are equal to 

J x 2 |wioo(r)| 2 dT = i J r 2 |Mioo(r)| 2 dr 
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so that (if 72 )oo = 6e 4 aj/i? 6 . Substitution into (27.14) gives 


W(R) ^ - 2139 (27.16) 

Variation Calculation. An upper limit on W(22) can always be 
obtained by the variation method. It is apparent, however, that some 
judgment must be used in the choice of the trial function thus if ^ 
does not depend on R, the dependence of the expectation value of the 
energy on R will be like that of //', that is, 1/i? 3 . An upper limit with 
this R dependence is of no value to us, since what we really want to 
determine is a limit on the coefficient of the 1 /R 6 interaction. A useful 
choice for \p will be one in which there is a term proportional to H since 
there will then be terms in the expectation value that are proportional to 
H' 2 and hence vary like 1/B 6 . 

We choose for the trial function 

iA(ri,r 2 ) = Uioo(ri)uioo(r2)(l -h AH') 

where A is to be the variation parameter. Since this \p is not normalized, 
we use (27.5) rather than (27.4) and obtain 

w ^ J>o(l + AH')(H n + ff>„( 1 + AH')dr x dr 2 

E ° + W{h) * -7JW1 + AH')*drWi { ) 

where again u 0 is the product of the ground-state hydrogen wave func¬ 
tions, and A is assumed to be real. The right side of (27.17) can be 
written 

Eu + 2 A(//'*)do + AKH'HoH'U /vt , Q , 

-_ r+ - (27.18) 

since Uq is a normalized eigenfunction of H 0 with the eigenvalue 


n c- 

A<» --» 

and H' 00 = (//' 3 )no = 0. It is easily seen that (H'HqH')m is a sum of 
squares of factors of the form Jwioo(r) x H 0 x Mioo(r)dr; this can be shown 
by direct computation to be zero. 

Since we are interested only in terms of order II' 1 , we expand the 
denominator of (27.18): 

[E Q + 2A(H'*U][l + 7! *(// /a )«,ol- 1 ^ Eo + (// ,2 )oo(2A - E {) A*) (27.19) 

If we remember that E o is negative, we find that (27.19) has a minimum 
with respect to variation of A when A = l/E 0 , in which case (27.17) 
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becomes 

Eo + W(R) + = Eo - (27.20) 


Thus in (27.16) and (27.20) we have both upper and lower limits on the 
interaction energy 


8 e 2 * a 5 Q 


^ W(R) ^ - 


6 e*al 
R G 


More careful variation calculations have shown that the numerical coeffi¬ 
cient in W(R) is very nearly 6.50. 1 

28. THE WKB APPROXIMATION 

In the development of quantum mechanics, the Bohr-Sommerfeld 
quantization rules of the old quantum theory (Sec. 2) occupy a position 
intermediate between classical and quantum mechanics. It is interesting 
that there is a method for the approximate treatment of the Schrodinger 
wave equation that shows its connection with the quantization rules. It 
is based on an expansion of the wave function in powers of h, which, 
while of a semiconvergent or asymptotic character, is nevertheless also 
useful for the approximate solution of quantum-mechanical problems in 
appropriate cases. This method is called the Wentzel-Kramers-Brillouin 
or WKB approximation , although the mathematical technique seems to 
have been first introduced by Rayleigh in another connection. 2 It is 
applicable to situations in which the wave equation can be separated 
into one or more total differential equations, each of which involves a 
single independent variable. 

Classical Limit. A solution ^(r ,t) of the Schrodinger wave equation 
(6.16) 

can be written in the form 

,/ .v A iW(rJ) 

>Kr ,t) = A exp — ; 


1 See L. Pauling and E. B. Wilson, Jr., “Introduction to Quantum Mechanics,” 
Sec. 47a (McGraw-Hill, New York, 1935). 

2 It is sometimes called the BWK method , the classical approximation , or the phase 
integral method. For the original work, see Lord Rayleigh, Proc. Hoy. Soc. A86, 207 

(1912); H. Jeffreys, Proc. London Math. Soc. (2), 23, 428 (1923); G. Wcntzel, Zeits. f. 
Physik., 38, 518 (1926); H. A. Kramers, Zeits. f. Physik., 39, 828 (1926); L. Brillouin, 
Comptes Rendus, 183, 24 (1926). For more recent developments, see TO. C. Kemble, 
“The Fundamental Principles of Quantum Mechanics,” Sec. 21 (McGraw-Hill, New 
York, 1937); R. E. Langer, Phys. Rev., 51, 669 (1937); W. H. Furry, Phys. Rev., 71, 

360 (1947). The treatment of this section resembles most closely those of Kramers 

and Langer. 
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in which case IF satisfies the equation 

^ + i (g radHT + F-*|v<H'-0 (28.1) 

In the classical limit (A -* 0), Eq. (28.1) is the same as Hamilton’s partial 
differential equation for the principal function W: l 

— + H( r,p) =0, p = grad W 

Since the momentum of the particle is the gradient of W, the possible 
trajectories are orthogonal to the surfaces of constant W and hence, in 
the classical limit, to the surfaces of constant phase of the wave function 
Thus in this limit the rays associated with \p (orthogonal trajectories 
to the surfaces of constant phase) are the possible paths of the classical 
particle. 

iEt 

If \// is an energy eigenfunction u(r)e h , W can be written 

= S(r) - Et 

In this case, we have that 

«(r) = A exp ^ r) i L (grad S)* - [E - F(r)] - | V»S = 0 (28.2) 

The WKB method obtains the first two terms (one term beyond the clas¬ 
sical expression) of an expansion of S in powers of A, in the one-dimensional 
case. 

Approximate Solutions. The basic equation that we consider is 
written in one of the forms 

~ + k-(-r)u = 0, lc- > 0 (28.3) 

~~ - K-(x)u = 0, K- > 0 (28.4) 

so that k and k are always real. These tire equivalent to the one-dimen¬ 
sional wave equation (8.5), if we put 

Hz) = + j mil - F(.r)]]> when Ffcr) < E 

(28.5) 

k(x) = +j t \2n[V(r) - E]\i when F(;c) > E 

1 E. T. Whittaker. “Analytical Dynamics,” 3d cd., See. 142 (Cambridge, London, 
1927). 
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Equations (28.3) and (28.4) are also equivalent to the radial wave 
equation (19.2) if x is replaced by r, V (: r ) is replaced by 

VI r) + 

K 1 ^ 2/j.r 2 

and u is equal to r times the radial wave function. 

We restrict our attention for the present to Eq. (28.3); we shall be 
able to generalize the resulting expression for u(x) to obtain solutions of 
(28.4). We put 

iS(x) 

u(x) = Ae h 

which on substitution into (28.3) gives the one-dimensional form of 
(28.2) 

ihS" - S' 2 + h*k* = 0 (28.6) 

where primes denote differentiation with respect to x. 

We substitute an expansion of S in powers of h into (28.6) and equate 
equal powers of h. 

S = $o H“ hSi + ■ • • 

-S'* + 2 P {E - V) = 0 

iSS - 2 S' 0 Si = 0, etc. 

Integration of these equations gives 

S 0 {x) = ±h J x k(x’)dx', 8 i(x) = H In k(x), 

where arbitrary constants of integration that can be absorbed in the 
coefficient A have been omitted. We thus obtain to this order of 
approximation 

u(x) = Ak -1 exp (±» f x kdx), V < E (28.7) 

In similar fashion, the approximate solution of (28.4) is 

u(x) = Bk- i exp (± J x Kdx), V > B (28.8) 

Asymptotic Nature of the Solutions. The accuracy of these WKB 
solutions can be gauged by comparing the magnitudes of the successive 
terms So and hSi in the series for 5. Since So is a monotonic increasing 

hS 

function of x so long as k does not vanish, the ratio is small if 

aSu 

hgf 

-qt- is small. We thus expect (28.7) to be useful in that part of the 
o 0 

domain of x where 


hS[ 


V 

S'o 


2k 2 


«1 


(28.9) 
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Now the local de Broglie wave length X is 2ir/k, so that (28.9) can be 
written 


X* dk 
47 r dx 


« k 


which means that the fractional change in k (or in the wave length) in 
the distance X/47T is small compared to unity. Thus the WKB solutions 
are useful when the potential energy changes so slowly that the momen¬ 
tum of the particle is sensibly constant over many wave lengths. 

The same criterion is obtained for (28.8) if we now mean by the “wave 
length” the distance in which the magnitude of u changes by a factor 
e 2t . 

It is apparent that the condition (28.9) is violated near the turning 
points of the classical motion, where V(x) = E, k and k are zero, and 
the “wave length” is infinite. Thus the solutions (28.7) and (28.8) are 
asymptotically valid in the sense that they can be used several wave 
lengths from the nearest turning point if, as is usually the case, the wave 
length is there slowly varying. 

The asymptotic solutions are of little use to us unless we know how to 
connect an oscillating solution like (28.7) to an exponential solution like 
(28.8) across a turning point. It is only in this way, for example, that 
we can apply boundary conditions and obtain energy eigenvalues. The 
derivation of such connection formulas , which we consider next, is the 
central problem of the WKB approximation. 

Solution near a Turning Point. The wave equations (28.3) and (28.4) 
are regular at a turning point, so that there is a solution that is analytic 
there and has asymptotic forms like (28.7) and (28.8). Such a solution 
usually cannot be written down in closed form. The wave equation can, 
however, be modified slightly so that an exact solution that has the desired 
asymptotic forms can be obtained. 

We can without loss of generality take the origin of # at a particular 
turning point; we also assume for the moment that V(x) < E to the right 

of the turning point (positive x ), and put £(x) = J q kdx. Now if 

k 2 (x) = Cx n , where C is a positive constant, Eq. (28.3) is known to have 
the solutions 

u(x) = Apk-U ± M), m = (28.10) 


where J is a Bessel function; this can be verified by direct substitution. 
The asymptotic form of J is such (see below) that (28.10) agrees asymp¬ 
totically with (28.7). 

We therefore try to retain this form by rewriting (28.3) with an 
additional term 0 (x ): 
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g + (** - 6)u = 0 (28.11) 

Substitution of (28.10) into (28.11) shows that the new equation is 
satisfied if we define 8 as 

yt L2 

e ^^w-2k + ( ~ m "-^e (28,12) 

We expand A* 2 as a power series in x: 

k 2 (x) = Cx n (l + ax + bx 2 + • • • ) 

in which case 8 can also be expanded in a series. The l/x 2 and l/x terms 
vanish, and the leading term is independent of x. 


6 (x) — 


_ 3 (n + 5)a 2 

~o, 2(n + 4) (n + 6 ) 


(28.13) 



Fig. 25. A typical linear turning point, where V(x ) = K at x = 0; in region 1, E > V(x ), 
and in region 2 , E < V(x). 


We can now see that (28.11) is a good approximation to the actual 
wave equation (28.3). The similarity in structure between each of the 
three terms in (28.12) and the asymptotic accuracy criterion (28.9) 
indicates that 6 « k 2 in the asymptotic region if the WKB method can 
be used at all. At and near the turning point, 6 is not negligible in com¬ 
parison with A* 2 , since 8 is a constant and k 2 vanishes at x = 0. However, 
(28.13) shows that 0 ( 0 ) is quite small, being of second order in the devia¬ 
tion of k 2 from the simple form Cx n . Thus for potential functions V(x) 
that are slowly varying, (28.10) is expected to be a good approximation to 
the actual solution of equation (28.3). 

Linear Turning Point. We now specialize to the situation of greatest 
physical interest, in which n — 1 . A typical linear turning point is 
shown in Fig. 25; Eq. (28.3) is used in region 1 (x > 0), and Eq. (28.4) in 

region 2 (x < 0 ). We put J* kdx, % 2 = f ° xdx 3 so that both £i 

and £ 2 increase as x moves away from the turning point; this makes it 
easy to generalize the results to situations in which the regions 1 and 2 
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are interchanged, 
regions are 


The two independent solutions in each of the two 


«!*(*) = 
«!*(*) = 


(28.14) 


It is evident that we must replace J by 7, the Bessel function of imaginary 
argument, in region 2. 

We require the leading terms of the power series expansions and of 
the asymptotic expansions for these functions: 1 


'Tfci(£i) 


(£h ) ±5 




*-.0 r(i ± i) 

- * cos 

E—> 00 

, (**.)* 


(*■* 5-0 


.->0 r(i ± *) 

-> (2 x| 2 )- l[c«> + ert> ■ e -U±P«] 


(28.15) 


It is important to note that the term in the asymptotic expansion 
for I can be retained only when a combination of solutions 7 ± j is chosen 
such that the coefficient of e«* is zero. This is because other terms in the 
asymptotic expansion, such as e£-/£ 2 , have been neglected, and these are 
of larger order of magnitude than <rf«. The asymptotic nature of the 
WKB approximation is such that if the term that increases exponentially 
away from the turning point is present, it is impossible to say whether or 
not the decreasing exponential term is also there. 

Connection at the Turning Point. The loading term in k 2 at x = 0 
is Cx, so that k S r.S, k £* {1 S (2n/3)j-», fe £* (2c/3)|*|«, where 

c = + Then from (28.14) and (28.15) we obtain the behavior of the 

w’s near x = 0 


"7 = A + 
i't = 1 >\ 


(*)»(*<■) 
i'(i) ’ 

(S)'(s'c) » 1 . 

r<i) w 


v -^n (mw-j 
2 ~ r(t) 


It is apparent then that uf joins smoothly on to u$ if B + = — A+, and 
that i/y joins smoothly on to if B_ = d_. 

These relations between the coefficients ean be used to obtain asymp¬ 
totic, forms like (28.7) and (28.8) for the two independent solutions u + 
and nr in the two regions (the arbitrary multiplying constants A ± are 
omitted). 


1 Whittaker and Watson, op. rit ('hup. 17. 
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The asymptotic forms of any linear combination of u + and vr can be 
found from Eqs. (28.16). 

Asymptotic Connection Formulas. Convenient connection formulas 
between the asymptotic WKB solutions in the two regions can be 
obtained by choosing suitable linear combinations of u + and vr, Thus 
the combination u + + vr contains only the decreasing exponential, and 
yields the first connection formula 


cos (Ji — T7r) 


(28.17) 


The arrow in (28.17) implies that the asymptotic solution in region 2 that 
appears on the left goes into the asymptotic solution in region 1 that 
appears on the right, but that the converse is not necessarily true. This 
is because a small error in the phase of the cosine introduces the dominant 
increasing exponential in region 2. 1 

Another linear combination of u + and u~ can be found that gives the 
second connection formula 


sin t) 2 A—l cos (£i — ix + rj) (28.18) 


where y is appreciably different from zero or an integer multiple of x. 
The arrow in (28.18) appears since the neglected decreasing exponential 
in region 2 alters the phase of the cosine in region 1 by an indeterminate 
amount if the connection is reversed. 

Energy Levels of a Potential Well. We now give a simple example of 
the application of the WKB approximation that serves as a derivation of 
one of the Bohr-Sommerfeld quantization rules. We wish to find the 
energy levels of a particle moving in the one-dimensional potential well 


1 The converse of (28.17) can be used in the following sense: If some parameter in 
the solution (such as the energy E) is varied continuously so that the phase of the 
cosine in region 1 passes through the value — i r, the increasing exponential in region 2 
disappears for some indeterminate value of the phase close to — \ir and leaves only the 
decreasing exponential. This result is useful, for example, in treating the resonance 
scattering of alpha particles by a heavy nucleus. 
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shown in Fig. 26. For any assumed energy level E, there are supposed 
to be just two turning points of the classical motion such that 

V(z t ) = V(x 2 ) = E 

The regions x < xi and x > x 2 are Type 2 regions in which we know that 
u decreases away from the turning points in order to satisfy the boundary 
conditions at ± 00 . Thus we have only the decreasing exponential WKB 
solution in these regions. 

The connection formula (28.17) can be applied at the turning point 
which separates a Type 2 region from the Type 1 region xi < x < x 2 . 



Flu. 26. Application of tho WKB method to a potential trough; linear turning points 
occur at xi and x». 

The only change is that the lower limit on the £i integral is changed from 
0 to X{, so that the solution to the right of the turning point is 

cos kdx — (28.19) 

apart from an arbitrary multiplying constant. The same connection 
formula can also be applied at x 2 by reversing the direction of the x axis 
and changing the fixed limit on the £ integrals from 0 to x 2 ; the arrow in 
(28.17) still means that we go from a region 2 solution to a region 1 solu¬ 
tion, but now the latter is to the left of the turning point and the former 

is to the right. We redefine £i = r kdx, £ 2 = K(ix so that they still 

increase going away from the turning point, in which case (28.17) can be 
used without any modification. The solution to the left of this turning 

point is then k~ * cos “ kdx — which can be written 

cos ^ kdx — i' 7 r — 77 ^, rj = j*‘ kdx — (28.20) 
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As was the case in the qualitative discussion of discrete energy eigen¬ 
values in Sec. 8, we obtain the energy levels of this system by requiring 
that the two solutions (28.19) and (28.20) join together smoothly in the 
interior of region 1. This evidently requires that v\ be zero or a positive 

integer multiple of i r, since / kdx is necessarily positive. We can write 

Jxx 

the determining equation for the eigenvalues as 

f Xi kdx = (n + iX n = 0, 1, 2, . . . (28.21) 

Jxx 

Equation (28.21) is to be used for values of n up to the point at which E 
becomes so large that one or both of the turning points disappears. 

A Quantization Rule. The expression (28.5) for k can be substituted 
into (28.21) to give one of the Bohr-Somerfeld quantization rules of the 
old quantum theory 

2 [** {2 P [E - V{x)]}*dx = {n + i)h (28.22) 

JXx 

The left side of (28.22) is the integral around a complete cycle of the 
motion (from Xi to # 2 and back to Xi) of the momentum [2 ji(E — F)]b 
The right side is the quantum value of the phase integral, with half¬ 
integer rather than integer quantum numbers. 

It is easily seen from the form of the solution (28.20) that n is the 
number of nodes of the WKB wave function between the turning points. 
Since it is basic to the WKB method that we can develop asymptotic 
solutions like (28.7) only several wave lengths from each turning point, 
the approximation should be good only if the turning points are several 
wave lengths apart, or if n is large in comparison with unity. This con¬ 
firms the earlier view that the WKB method is a semiclassical approxima¬ 
tion, since it is expected to be most useful in the nearly classical limit of 
large quantum numbers. 

Actually, the WKB approximation also gives quite good results for 
the low quantum states of many systems. For example, if we apply 
(28.22) to the harmonic oscillator V(x) = }Kx 2 , it is known from the old 
quantum theory that the correct energy levels are obtained for all quan¬ 
tum numbers. 

Special Boundary Conditions. The boundary condition to be applied 
to a WKB solution at a perfectly rigid wall (V changes discontinuously 
to + oo at x = xq) is that the wave function vanishes there. Thus if k 
(for a region of Type 1) is slowly varying up to x 0 and other turning points 
are remote, the asymptotic solution can be used and has the form 

Zr"*sin (^J x kdx^ • 
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Similarly, for a finite potential step that is far from other turning 
points, the asymptotic WKB solutions can be used up to the point of 
discontinuity of V if k or k is slowly varying. Then the magnitudes and 
slopes of the solutions on the two sides can be matched at this point. 

As pointed out after Eq. (28.5), the WKB method can be applied 
to the radial wave equation for a spherically symmetric potential. When 
l = 0, the radial wave function must be finite at r = 0, and so u must 
vanish there. If k or k is slowly varying there, the asymptotic solutions 
can be used; for example, if E — V(r) is positive, finite, and slowly vary¬ 
ing at and near r — 0, the solution is k~ * sin kdr^. When the 

effective potential energy is infinite at r = 0, either because V itself is 
infinite or because of the centrifugal-force contribution for l ^ 0, the 
situation is more complicated and requires further investigation. 

Problems 

1. A harmonic oscillator of charge e is perturbed by an electric field of strength E 
in the positive x direction. Calculate the change in each energy level to second order 
in the perturbation. Show that this problem can be solved exactly, and compare the 
result with the perturbation approximation. 

2. A harmonic oscillator is perturbed by an extra potential energy fra; 3 . Calculate 
the change in each energy level to second order in the perturbation. 

3. Find the first-order Stark effect for a hydrogen atom in the state n — 3. 

4. A system that has three unperturbed states can he represented by the per¬ 
turbed Hamiltonian matrix 

IE, 0 a \ 

(0 Ei b ) 

\a b K*} 

where E 2 > E\. Use the second-order nondegeneratc perturbation theory to find the 
perturbed eigenvalues. Then diagonalize the matrix to find the exact eigenvalues. 
Finally, use tins second-order degenerate perturbation theory, (loinpure the three 
results obtained. 

5. Show that the total scattering cross section by a potential that falls off at great 
distances like r~ n is finite if n > 2, (a) by means of the Born approximation formula 
(20.20), and (b) by means of the Born approximation expression for the phase shifts 
(26.27) (see footnote 1, page 78). 

_r 

6. Find the differential scattering cross section for a potential V(r) = — F »e a , 
using the Born approximation. What is the validity criterion in this case, and under 
what circumstances is it satisfied? 

7. In a particular scattering problem in which the potential is spherically sym¬ 
metric, the phase shift 5« is large and can he computed exactly, but all the other phase 
shifts are small. Derive an expression for the differential scattering cross section with 
the help of the Born approximation, in which all the phase shifts are taken into 
account,. 

8. Use the Born approximation to dismiss qualitatively the scattering by a crystal 
lattice of identical atoms. 
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9. A trial function ^ differs from an eigenfunction ue by a small amount, so that 
^ - ue + where ue and \pi are normalized and e 1. Show that ( H ) differs 
from E only by terms of order e 2 . 

10. If the first n — 1 eigenfunctions of a particular Hamiltonian are known, write 
down a formal expression for a variation-method trial function that could be used to 
get an upper limit on the nih. energy level. 

11. Find the next terms (of order R~ A ) in the expansion of Eq. (27.12). Show 
that their diagonal matrix element for the unperturbed ground state vanishes, so that 
there is no inverse fourth power contribution to the van der Waals interaction. 

12. Use the combination of perturbation and variation methods employed in Sec. 
27 in connection with the van der Waals interaction to obtain limits on the electric 
susceptibility of a hydrogen atom in its ground state. The electric susceptibility is 
the ratio of the induced electric-dipole moment to the applied electric field, or is the 
negative of the second derivative of the perturbed energy with respect to the electric 
field at zero applied field. 

13. A particle of mass m is bound by the potential of Prob. 6, where ft?/mV oa 2 = £. 
Use the variation method with a trial function e~ ar to get a good limit on the lowest 
energy eigenvalue. 

14. Show that the WKB approximation gives the correct energy eigenvalues for all 
states of the harmonic oscillator. 

15. Apply the WKB method to the one-dimensional motion of a particle of mass m 
in a potential that equals — V o at x =0, changes linearly with x until it vanishes at 
x — ± a, and is zero for \x\ > a. Find all the bound energy levels obtained in this 
approximation if mV oa 2 / ft? — 40. 

16. Discuss the connection between the WKB approximation and the penetration 
through “opaque” potential barriers; the barriers are to be like that considered in 
Sec. 17, although not necessarily square. 
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APPROXIMATION METHODS FOR TIME-DEPENDENT 

PROBLEMS 

It is generally impossible to obtain exact solutions of the Schrodinger 
equation when the Hamiltonian depends on the time. The three approxi¬ 
mation methods that we consider in this chapter all start from the 
assumption that there is a time-independent Hamiltonian that approxi¬ 
mates the actual Hamiltonian in some sense, for which the Schrodinger 
equation can be solved. The time-dependent part of the actual Hamil¬ 
tonian may be small compared to the stationary part, in which case a 
perturbation method can be used. Or there may be time-dependent 
parameters in the actual Hamiltonian that change very slowly ( adiabatic 
approximation) or very rapidly ( sudden approximation) in comparison 
with the periods of the approximate stationary solutions. 

29. TIME-DEPENDENT PERTURBATION THEORY 

The perturbation theory of a system for which the Hamiltonian 
depends on the time 1 is sometimes called the method of variation of con¬ 
stants. It starts from the assumption of Sec. 25 that 

H = Ho + IiH 0 u n = E n u n (29.1) 

where the unperturbed Hamiltonian II 0 can be solved for its normalized 
eigenfunctions u n and its energy eigenvalues E n , and the perturbation H' 
is small. Since IV now depends on the time, stationary solutions of the 
actual Schrodinger equation do not exist, and we must work with the 
time-dependent equation 

ih = H4, (29.2) 

Expansion in Unperturbed Eigenfunctions. Our procedure is to 

xKnt. 

express $ as an expansion in the eigenfunctions u n c. h of the unperturbed 
time-dependent wave equation, where the expansion coefficients evi¬ 
dently depend on the time. 

iEnt 

$ = §a n (t)u n e k 

1 P. A. M. Dirac, Proc . Roy. Soc ., A112, CGI (1926); A114,243 (1927). 
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$ denotes a summation over the discrete set together with an integration 
over the continuous set of eigenfunctions. Substitution of (29.3) into 
(29.2) gives 

iEnt iEnt iEnt 

^ifid n 'Un& ^ “H $a n E n u n e ^ ^ 

where the dot denotes differentiation with respect to the time. 

We replace H 0 u n by E n u n on the right side, multiply through on the 
left by Uk, and integrate over all space, making use of the orthonormality 
of the vt s 

iElet iEnt . 

ihd k e h = S a n e h j u k H : u n dr 

The integral on the right is the matrix element H' kn of the perturbation. 
We define the Bohr (angular) frequency 

__ E k E n 

Ukn — 

and obtain 

d k = (ih)- 1 §H' kn a n e iuknt 

The group of Eqs. (29.5) for all Fs is exactly equivalent to the Schro- 
dinger equation (29.2); the amplitude a n of a particular unperturbed 
eigenfunction u n in the expansion of \f/ has replaced the amplitude ^ at a 
particular point in space. Because of the choice of the representation, 
which is determined by the eigenfunctions of the unperturbed Hamilton¬ 
ian, H o does not appear explicitly in (29.5). 

The perturbation approximation consists in replacing H' by \H f in 
(29.1) and (29.5), and expressing the a’s as power series in X: 

an = a™ + Aa|» + X 2 < 2) + • • • (29.6) 

As in Sec. 25, we assume that these series are analytic for X between 0 
and 1. We can therefore substitute (29.6) into (29.5), equate coefficients 
of equal powers of X, and set X = 1 in the final results. The substitution 
yields the set of equations 

4'” = 0; 4“ +1) = (^)- 1 S^LaSf ) e“‘" i , s = 0, 1, 2, . . . (29.7) 

These can in principle be integrated successively to obtain approximate 
solutions to any desired order in the perturbation. 

First-order Perturbation. The first of Eqs. (29.7) shows that the 
zero-order coefficients a[ 0) are constant in time. Their values are the 
initial conditions of the problem, which specify the state of the system, 
before the perturbation is applied. We assume throughout this section 


( 29 . 4 ) 

(29.5) 
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that all except one of the a^ are zero, so that the system is in a definite 
unperturbed energy state when the perturbation is applied. 1 The results 
that we shall obtain can easily be generalized to situations in which more 
than one of the zero-order coefficients is different from zero. 

We thus put = 8 km or 5 (k — m), according as the state m is one 
of a discrete or a continuous set. Integration of the first-order equation 
gives 

= (iK)~' jl m H , km (t')e iut "‘ t 'dt' (29.8) 

where the constant of integration is taken to be zero in order that a& 
be zero at t = — oo (before the perturbation is applied). If H f is of 
finite duration, the amplitude of a state u k (k ^ m) after the perturbation 
has disappeared is proportional to the time Fourier component of the 
matrix element of the perturbation between this state and the initial state, 
that corresponds to the angular frequency ow given in (29.4). This 
result is analogous to that obtained for the scattered amplitude in the 
Born approximation [see the discussion of Eq. (26.18)]. 

Equation (29.8) takes a particularly simple form if the perturbation 
H' is independent of the time except for being turned on at one time and 
off at a later time. We call these two times 0 and t, respectively, and 
obtain for the first-order amplitudes at the time t (these are also the 
amplitudes at any subsequent time) 

77' pi o )hint __ 1 

«?'(0 = - y“ —-- (29.9) 

7 /- 0 )km 

Thus the probability of finding the system in the state k at t is 

|<(/.)| S = ^ kmt 

The factor sin- ioo k J./wl 1n is plotted in Fig. 27 as a function of aw 

Physical Interpretation. The height of the main peak in Fig. 27 
increases in proportion to 7/ 2 , while its breadth decreases inversely as t , so 
that the area under the curve is proportional to l. Thus if there is a 
group of states k that have energies nearly equal to that of the initial 
state m, and for which ///,,„ is roughly independent of /r, the probability of 
finding the system in one or another of these states is proportional to t. 
This is the physically interesting situation, since what we wish to calculate 
eventually is a transition probability per un.il. time w, and this implies that 

1 This need not conflict, with the uncertainty relation (3.3), wince the infinite lapse 
of time prior to the application of the perturbation makes it possible to determine the 
original energy of the .system with arbitrarily great precision. 
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the probability that a transition has taken place when the perturbation 
has been on for a time t is proportional to t. 1 

It follows that a definite value of w exists only when the final state k 
is one of a continuous or nearly continuous set of states. The spread in 
energy of the final states to which transitions occur, shown in Fig. 27 



Fig. 27. The ordinate is proportional to the first-order perturbation probability of finding 
a system in a state that has an energy different from that of the initial state by hojkm .; the 
scales for ordinate and abscissa depend on the duration t of the perturbation in the manner 
indicated. 

(E k = E m + hcokm)t is connected with the uncertainty relation (3.3) for 
energy and time in the following way. We can regard the perturbation 
H' as a device that measures the energy of the system (which is not 
necessarily its initial energy since the system is disturbed) by transferring 
it to one of the states k. The time available for the measurement is t y 
so that the uncertainty in energy predicted by (3.3) is of order h/t } in 
agreement with the breadth of the main peak in Fig. 27. It is interesting 

1 We assume that the total transition probability to all states k is small enough in 
comparison with unity so that the initial state m is not significantly depleted. This is 
equivalent to the original assumption that the perturbation is small, which means 
that for times t of physical interest, there i little change in the initial state. There 
can still be an effect of observable magnitude if a large number of independent system^ 
receive identical treatment. 
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to note that conservation of energy, suitably modified by the uncertainty 
principle, is an automatic consequence of the calculation and does not 
have to be inserted as a separate assumption. 

Transition Probability. In order to obtain an explicit expression 
for iv, it is convenient to assume that the system is contained in a large 
cubical box of dimensions L that has periodic boundary conditions at its 
walls (Sec. 10). Then the eigenfunctions u n form a discrete set and can 
be normalized to unity in the volume IP. We now consider a particular 
group of final states k that have nearly the same energy as the initial 
state m and for which the matrix element Il' km of the perturbation is a 
slowly varying function of k. We define a density of final states p(k) 
such that p{jk)dE k is the number of such states in the energy range dE kf 
and assume that p(k) is also a slowly varying function of k. 

The transition probability per unit time to one or another of this group 
of states can then be written 


w = r 1 £ |a?»(0l* = r* f \ap(t)\*p(k)dE h (29.10) 

k 


when the box L is large enough so that the summation over k can be 
replaced by the integration over E k . Since H', cm and p(A*) are slowly vary¬ 
ing and most of the contribution to the integral comes from a narrow 
range of energy about E k = E mj they can be taken outside of the integral, 
and (29.10) can be rewritten as 


iv 


1 4 


\HL 

' A 


P(fr) 


(- 


sin 2 iukmt 


^km 


do)k 


(29.11) 


where the index k now refers to a typical one of the group of states having 
about the energy E m . The integral in (29.1.1) is J xr 2 sin 2 xdx — fart, 
so that we finally obtain 

w = 2r h p(A')|//;,J 2 (29.12) 


which is independent of /, as exp(H*.t.ed. 

There may b(^ several different groups of final states k, all of which 
have about the energy E tn but for which the perturbation matrix ele¬ 
ments H' km and the densities of states p(/r), while nearly constant within a 
group, differ from one group to another. Then (29.12) gives the transi¬ 
tions per unit time to a particular group; similar expressions of the same 
form give the rates of transition to other groups. 

Scattering Cross Section. As a first application of Kq. (29.12), we 
calculate w when the initial and final states are frec-particle momentum 
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eigenfunctions (plane waves) and the perturbation is a potential energy 
V(r). The result can be interpreted in terms of an elastic scattering cross 
section, and then agrees with the Born approximation result (Sec. 26), as 
expected. We take for the initial and final states 

u m ( r) = lr l exp iko • r, u k (t) = L~% exp Ik • r 

where k 0 and k are the initial and final propagation vectors, respectively. 
Thus the perturbation matrix element is 

= L~ 3 / exp ( — Ik • r)F(r) exp (fck 0 • r )dr 

= L~ 3 JT(r) exp (IK • r )dr (29.13) 

where K = k 0 — k. 

The density of final states can be found from the permitted values of k 
in a box: k z = 2 \irn z /L } etc., where the n’s are positive or negative integers 
or zero. Thus there are ( L/2tt ) Z dk^dk y dk z states in the range dk x dk y dk z of 
propagation vector. Now there are many different final states k with the 
same energy, corresponding to different directions of k with a given 
magnitude. The matrix element (29.13) usually depends on the direction 
of k, so that we have to consider only a small range of directions at a time. 
We therefore ask for the rate of transition into an infinitesimal element of 
solid angle sin dddd<t> about some direction that is specified by the polar 
angles 0,<£. Then p(k)dE k is equal to the number of states in the range 
dr k given by the above solid angle element and the magnitude element dk 
that corresponds to the energy element dE h . 

kHk sin 6ddd<j> 

Since E k = h 2 k 2 /2p, dE k /dk — h 2 k/p, and we obtain for p(k) 

P(k) = g — 3 ^ - 2 k sin 0d6d(j> (29.14) 

The value of w obtained in this way is the number of particles scat¬ 
tered into the element of solid angle per unit time when there is one 
incident particle in the volume L 3 . This is an incident flux of v/L 3 
particles per unit area and time, where v = hk/p is the speed of the inci¬ 
dent or scattered particle (since energy is conserved). Since the dif¬ 
ferential scattering cross section is defined as the scattering per unit 
incident flux, we have that 

llL 3 

cr(0,4>) sin 6d6d(j> = w 
ftk 


o(k)dE k = 




(29.15) 
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r(d,4>) = ( 2 ^ || F(r) exp (tK • r )di 
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This agrees with the Born approximation result (26.18) and (26.19), 
and has the same range of validity. 

Harmonic Perturbation. Another situation for which Eq. (29.8) 
assumes a simple form occurs when the perturbation depends harmoni¬ 
cally on the time, except for being turned on at zero time and off at time t. 
If we put H km (t') = HZ, sin o>t f , the first-order amplitudes at time t are 


am = - 


2 ih L + u 


gifakm—lt))t - ^ 

Oilcm 0) 


(29.17) 


The probability of finding the system in the state k is appreciable only 
when the denominator of one or the other of the two terms in (29.17) is 
practically zero. Thus there is no interference between the two terms, 
and the perturbation can produce transitions for which <a km = ±co if the 
corresponding matrix element does not vanish. The energy-conserva¬ 
tion condition E k S= E m obtained earlier is now replaced by the condition 


E k = E m ± hcc 


(29.18) 


Equation (29.18) suggests that the first-order effect of a perturbation 
that varies harmonically in the time with angular frequency oi is to trans¬ 
fer to or receive from the system on which it acts an amount of energy 
ha j. This concept will be used for a qualitative treatment of radiation 
processes in Chap. X. 

Second-order Perturbation. The scries of equations (29.7) can read¬ 
ily be solved to second order for a perturbation that is constant in time. 
We take the equation with 8=1, and substitute from (29.9) on the right 
side. 




y_ C U kn ^ nln - (jgiukmt 
hr aj nin 




Integration of this equation subject to the initial condition <4 S, (0) = 0 
gives for the second-order amplitudes at time t 


_ //' //' f> 

«!?>(/) = h--S kn nm - 

0)nm 


km I - 

0) km 


^ilOknt - ] 

Oik -n 


(29.19) 


Equation (29.19) indicates that transitions for which the probability 
increases linearly with the time can occur either lor w*.,,, = 0 or &>&» — 0- 
While the first typo of transition conserves energy between the initial 
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state m and the final state k , the second need not. It is not difficult to 
see that the second bracket term arises from the 1 in the numerator of 
(29.9), which in turn comes from the initial condition at zero time. This 
initial condition means that the perturbation is turned on suddenly; thus 
the mathematical formulation suggests that the second-order transitions 
that do not conserve energy are caused by the sudden turning on of the 
perturbation. This is in agreement with Eqs. (29.8) and (29.17), which 
show that a perturbation that has nonzero frequency Fourier components 
can give up energy to or absorb energy from the system that it perturbs. 
In the case we are now considering, these Fourier components are not 
marked enough to produce in first order a transition probability that is 
proportional to the time, but they do in second order. 

In most practical problems, the sudden turning on of the perturbation 
is introduced as a mathematical artifice that simplifies the calculation. 
Actually, in such cases, the perturbation either is always present, or is 
turned on very slowly, and we are concerned with transitions that con¬ 
serve energy between initial and final states. Problems that can be 
treated by the sudden approximation (see end of Sec. 31) are an excep¬ 
tion; there energy need not be conserved. Throughout this section and 
the next, we assume that only transitions that conserve the energy actu¬ 
ally occur (a ) km = 0), 

Suppose now that the perturbation produces no transitions in first 
order; this means that there are no states n that conserve energy (co nm ^ 0) 
for which the matrix element H' nm ^ 0. Since o) h m = 0, this means also 
that H f nm = 0 whenever = 0. In this case, the second term in the 
bracket of (29.19) is never appreciable. The calculation of w is carried 
through as before, except that a replaces a£ J ; thus (29.12) can be used 
if the matrix element H^ n is replaced by the second-order matrix element 

S eI'-e* (29>20) 

Effect of First-order Transitions. In the event that transitions can 
occur in first order, but they are not to the state in which we are inter¬ 
ested, we can proceed as follows. It is still true that the second term in 
the bracket of (29.19) is negligible for states n that have energies appreci¬ 
ably different from E k (or E m ), since then u kn is large. However, there 
may now be states n for which E n , E m , and E h are all close together and 
neither H r hn nor H f nm is zero. The second bracket term cannot be ignored, 
for without it the summation or integration over n would have a singu¬ 
larity when o) nm is zero. It is not difficult to see that for any value of 
oikm (zero or otherwise), the entire bracket is proportional to o > n « (which 
is equal to oikm — u kn ) when co n m is small; this cancels out the u) nm 
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in the denominator, and makes the summand or integrand finite 
where w n m = 0. 1 

We now show how an explicit evaluation of (29.19) is obtained in this 
case if S can he represented by an integral over E n or co nTn . We divide the 
integral into parts according as |a> nw | is large or is not large in comparison 
with l/t. In the first region, the second bracket term in (29.19) can 
be neglected, since |ojA, n | = |u;tm — u nm \ is also large in comparison with 
l/t (o) fcm = 0 means that takJ, is not large in comparison with unity). 
We thus obtain for this part of the integral 

— - 1 f p(n)hdo> nm (29.21) 

Ukm J Unm 

Here p(?i)dE n is the number of states of the particular group n under 
consideration in the energy range dE n \ the prime on the integral implies 
that the region —c/t ^ co nm ^ c/t is excluded from the integration, where 
c is a constant number that is large in compaiison with unity. If there 
are two or more distinct groups of states n for which the matrix elements 
or densities of states differ, a further summation over these different 
groups must eventually be earned out. 

In the second region, where |oj Tim | ^ c/t, we assume that t is large 
enough so that H f hn H' nm p(n) can be regarded as a constant, taken outside 
of the integral, and evaluated at co nM * = 0. Wc must now use both terms 
in the bracket of (29.19) in order that the integrand bo finite. This part 
of the integral is then 


J ' r c *> km t _ i 

~- 

_ f [ tdfrm 


-A (29.22) 

Oikm _ OJnm 


The integral that appears in (29.22) can be evaluated by considering the 
contour in the complex aj„,„ plane shown in Fig. 28, which contains no 
poles of the integrand. Thus the integral over the closed contour is 
zero, and the integral in (29.22) is equal to the integral around the semi¬ 
circle of radius c/t taken in the counterclockwise direction. The magni¬ 
tude of c o nm is great enough over this semicircle so that the contribution 


1 This result follows quite generally from the structure of the whole perturbation 
calculation, since there is no way in which a singularity cun appear. Thus if the 
perturbation is turned on slowly rather than suddenly, so that the fsnergy-conservation 
difficulties mentioned above do not occur, the second bracket term of (29.19) has a 
more complicated form but still cancels out the singularity at w nm ™ 0. This ean be 
verified by direct calculation. 
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of the second term in the integrand can be neglected in comparison with 
the first. The integral in (29.22) is then easily evaluated and becomes 

flTU. t - 1 

7ri - (29.23) 

Ukm 

For large t, the prime on the integral in (29.21) is equivalent to taking 
its principal value. 1 Thus if we substitute (29.23) into (29.22) and add 
the result to (29.21), we obtain an expression like (29.21) except that the 
primed integral is replaced by the principal value of the integral plus 
iri times the residue of the integrand at the pole u nm = 0. This is equiva¬ 
lent to evaluating the integral along a contour in the complex co nm plane 



Fig. 28. Contour for the evaluation of the integral in Eq. (29.22). 

that passes along the real axis from — a>to °o except for passing beneath 
the origin. We thus obtain finally 

<(,) - ^ I a > in)dE ’ ‘ 29 - 24 > 

where the contour C is over the real axis of E n except for passing under 
the pole of the integrand at E n = E m . Equation (29.24) is to be used in 
place of (29.19) whenever S can be represented by Jp(n)dE n . Compari¬ 
son of Eqs. (29.24) and (29.9) shows that we can use the expression (29.12) 
for w if we replace the matrix element H' km by the integral in (29.24), 
which we sometimes call the second-order matrix element. An example 
of this is given in the next section. 

Intermediate States. We sec that the time-dependent perturbation 
theory gives a result in first order if there is a nonvanishing matrix 

1 E. T. Whittaker and G. N. Watson, “A Course of Modern Analysis,” 4th ed., 
pp. 75, 117 (Cambridge, London, 1935). 
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element of H f that connects the initial state m and the final state Zr. 
If H' km = 0, but there are one or more states n for which neither H' nm nor 
Hfa is zero, the transition occurs in second order. 

It is then convenient to think of one of the states n as an intermediate 
state: the perturbation transfers the system from m to A* in two steps, 
through a state n . Energy need not be conserved for an intermediate 
state, since it has only a transient existence and according to the uncer¬ 
tainty relation (3.3) it is impossible to determine the energy of such a 
short-lived state with any precision. If some of the intermediate states 
do conserve energy, the summation (29.20) over these states must be 
interpreted in accordance with the integral in (29.24). 

In some cases, a perturbation can produce a particular transition only 
through two or more different intermediate states; this corresponds to a 
third or higher order of the perturbation calculation. If the perturba¬ 
tion is small, it usually happens that the result of a calculation to the 
lowest order in which the transition occurs gives a useful result, while 
higher order calculations do not improve on this and may even be quite 
misleading. 


30. INELASTIC COLLISIONS 

The expression for the scattering cross section given in the preceding 
section is easily generalized to a description of inelastic collisions, in which 
internal as well as kinetic energy can be transferred between the colliding 
systems. In this section we apply the result to two problems that are 
typical of first-order and of second-order processes. 1 The latter calcula¬ 
tion is of unusual theoretical interest, for it shows explicitly how a particle 
that is described entirely in terms of a plane wave (momentum eigen¬ 
function) can produce a sharp track in a Wilson cloud chamber. 

Expression for the Cross Section. The expression (29.12) for the 
rate of transition w is applicable to inelastic collisions if the matrix ele¬ 
ment is defined accordingly. We consider here the collision of a fast 
electron with a hydrogen atom in its ground state, and wish to calculate 
the cross section for scattering of the electron through a definite angle 
accompanied by excitation of the hydrogen atom to a definite state. We 
leave out of consideration the possibility that the incident electron 
changes places with the atomic electron; such exchange collisions will be 
taken up in Chap. IX. 


1 The examples considered in this section can also bn treated by an extension of 
the Born approximation; such an extension to first-order rearrangement collisions is 
given in See. 34. For the treatment of second-order processes, it is more convenient 
to work with the method of variation of constants. 
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The unperturbed Hamiltonian is the sum of the kinetic energy of the 
incident electron and the Hamiltonian for the hydrogen atom: 


Ho 


—. v? — — — — 

2m 1 2m 2 r 2 


(30.1) 


where Ti and r 2 are the coordinates of the incident and atomic electrons, 
respectively, referred to the atomic nucleus, which is massive enough so 
that its motion can be neglected. The perturbation is the electrostatic 
energy of interaction between the incident electron and the electron and 
nucleus of the atom 

p2 p2 

W = — - - (30.2) 

r 12 r i 

The unperturbed wave functions are eigenfunctions of (30.1), which 
we choose to be 

Ir a exp (zko ■ rOwioofa) initial state o\ 
I/"" 8 exp (ik • rO^oofe) final state ^ 


In spectroscopic notation, this is a IS —> 2S transition of the atom. The 
magnitude of the propagation vector of the electron after the collision is 
fixed by conservation of energy 


P = kl - 


2m 3e 2 
h 2 8a 0 


(30.4) 


Equations (30.2), (30.3), and (30.4) specify the matrix element that 
appears in (29.12). 

Hii = lr z J J exp (£K • ri^oofra) - “^ioo(r 2 )rfridr 2 , 

K = k 0 - k (30.5) 

The differential cross section can be obtained from w as in the preceding 
section. It must be remembered, however, that k appears in the expres¬ 
sion (29.14) for the density of final states, whereas the initial speed 
y 0 = hko/m appears in the expression for the incident particle flux. Thus 
the cross section is 

(3o - 6) 

where 9 is the angle between the vectors k and k 0 . 

Evaluation of the Matrix Element. It is apparent from the structure 
of the matrix element (30.5) that the term e 2 / r i in the integrand contrib¬ 
utes nothing because of the orthogonality of ^i 00 and u % 00 . This is to be 
expected physically, since interaction between the incident electron and 
the nucleus cannot produce excitation of the atomic electron. 
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The integration of the remaining term over r x can be carried out by 
transforming the volume element from dridr 2 to dr p dr 2) where 9 = — r 2 ; 

the Jacobian of the transformation is easily seen to be unity. We can 
then write 


f SSLg--ftdrt 
J r l2 

where we have taken the polar axis of 9 along the vector K and put w 
for the cosine of the angle between 9 and K. The last integral is not 
strictly convergent, but can be evaluated by inserting an integrating 
factor e~ a P into the integrand and subsequently taking the limit a —> 0 . 
The justification for this is that the integration over r 2 in (30.5), if per¬ 
formed first, gives a result that falls off like \/r\ and hence like 1/p 2 for 
large p , 1 so that for large p, the integrand here behaves like sin Kp/p and 
the integral converges. We thus obtain 

[ eXp — - - (hi = — exp (iK ■ r 2 ) lim f sin Kp<r°>dp 

J ria A „-->() Jo 

= — exp (iK ■ it) lim = 7 ^ exp iK • r 2 (30.7) 


= exp (iK • it) J 

= 2x exp (iK • r 2 ) J J e iKpw pdpd\o 

/ sin Kpdp 
Jo 


= j? exp (iK • r 2 ) 


Substitution of Eq. (30.7) and the expressions following Eq. (10.24) for 
the hydrogen wave functions into (30.5), gives an integral over r 2 that 
can be evaluated as 


it t = j _3 1C \Z2imlv- 
21 (K*ai+$)*’ 


a 0 


J . 2 

me® 


Differential and Total Cross Sections. The differential cross section 
for this collision thus becomes 


where 


a(e) 


k 128a 2 
E (K 2 al +*)• 


(30.8) 


K' 1 = k\ + k' 1 - 2 kje coh 6 

= (2/f 0 Min ^fl) 2 — (A - o — A-)(/r 0 + A- — 2/r 0 coh 9) 


1 The loading term in l/m? when r\ » /■* is 1 /ri, and the integral of this over r« 
vanishes because of the orthogonality of u n,o and u-sou. 
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Now the perturbation calculation is best when koao » 1, in which case k 
is close to ko and we can rewrite (30.4). 


n i\n _i_ 7 \ 2m3e 2 _ 3 
(*o *)(*• + *) h 2 8a# 4a s 


ko + k ~ 2k 0} 


k Q — k = 


8 k 0 al 


(30.9) 


With the help of (30.9), the expression for K 2 in this high-energy 
limit becomes 

K 2 ^ (^tkl — sin 51 id ^ (2 k Q sin id) 2 

Then according to (30.8), most of the scattering occurs for Ka Q <, 1, 
which is equivalent to 6 £ 1/kocto. Beyond this, o*( 0 ) falls off with 
increasing angle approximately like cosec 12 id. This is a much more 
rapid decrease with angle than the cosec 4 id dependence (26.32) obtained 
for elastic scattering by an atom, and is characteristic of inelastic 
processes. 

The total cross section is found by making use of the exact expression 
for K 2 to replace the element of solid angle 2rrsin ddd by 2irKdK/kok i 
with limits ko — k and ko + k. Then the integral of (30.8) can be 
obtained explicitly. However, the discussion of the last paragraph 
shows that at high energies most of the contribution to the integral comes 
from near its lower limit, and in accordance with (30.9) 


(fro “ k) 2 al ^ 


9 

Wfrjag 



We thus obtain the leading term in the total cross section at. high energies 
by taking the limits 0 and °o for K: 


cr =— 


128 t r /2Y° 
5frg W 


(30.10) 


Cross sections for elastic and other inelastic collisions with hydrogen 
may be obtained by replacing m 2 oo in the matrix clement (30.5) by the 
appropriate final-state wave function, and modifying (30.4) accordingly. 
The total elastic cross section at high energies turns out to be 7ir/Skl, 
which is about five times as large as (30.10). Excitation to the states 
that have n = 2 , l = 1 (1 S — > 2 P transitions) is most easily calculated by 
choosing the three final states (m = 0 , ± 1 ) with their polar axis along 
the momentum transfer vector K. Then the factors c*'* that appear in 
the wave functions for m = ± 1 make these matrix elements vanish and 
only the state (210) is excited. This corresponds physically to the 
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inability of the incident electron, whose momentum loss is along K, to 
exert a torque on the atomic electron about this axis. The high-energy 
cross section for this process turns out to be 

<r & (I) ln (4froao) (30.11) 

The appearance of the logarithmic factor in Eq. (30.11) derives from an 
extra factor 1/K 2 in the differential cross section. Thus in comparison 
with the IS —> 2 S scattering, the IS —» 2 P differential scattering is more 
pronounced at small angles and the total scattering decreases less rapidly 
with increasing energy at high energies. 

Production of a Cloud-chamber Track, It seems surprising at first 
that a fast electron, which we can assume possesses a definite momentum 
(magnitude and direction) and hence cannot be localized in space, can 
produce a sharp track in a cloud chamber. This phenomenon may be 
considered from various points of view. In accordance with Ehrenfest’s 
theorem (Sec. 7), we can represent the electron by a wave packet whose 
center of gravity moves like a classical particle. If the wave length is 
short enough, the packet can be quite small without spreading rapidly, 
and will then interact only with atoms that lie close to the path of its 
center. This implies that the electron is represented by a superposition 
of plane waves and hence has an uncertainty in its momentum that 
enables its position to be sufficiently well defined. 

Another approach consists in describing the electron by a single plane 
wave, and regarding its interaction with the first atom that it excites or 
ionizes as a position measurement that carries with it an uncertainty of 
the order of the atomic, size. Thereafter, the electron is represented by a 
packet, like that described in the last paragraph, which is well localized 
if the first atom is large in comparison with the wave length. 

We consider here in detail a description in which the electron and the 
atoms of the cloud-chamber gas are treated as parts of a single system, so 
that we do not have to regard an atomic interaction as a position deter¬ 
mination that changes (-he structure or the electron’s wave function. 1 
To simplify matters, we assume that there are just two atoms present, in 
their ground states, and that their nuclei are far from each other and are 
fixed in space. We then calculate the cross section for a process in which 
both atoms are excited and the electron is scattered inelastieally. For a 
fast incident electron, the perturbation theory can be used, and the 
process is of second order. The calculation is interesting both because 

1 See also W. Heisenberg, “The Physical Principles of the Quantum Theory,” 
p. 06 (University of Chicago Press, Chicago, 1930). 
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of the answer obtained and because it provides an instructive application 
of the second-order perturbation theory developed in Sec. 29. 

The result of the calculation is that the cross section is very small 
unless the momentum vector of the incident electron is nearly parallel 
to the line that joins the two nuclei, and unless also the incident and final 
electron momenta are nearly parallel. These three directions can have 
an angular spread in radians that is of the order of the ratio of the wave 
length of the electron to the size of the atom. This is analogous to the 
result obtained above for the inelastic collision of a fast electron with a 
hydrogen atom: the angular spread of the scattered electron was found 
to be roughly l/k Q ao. It is also in agreement with the wave-packet 
description of the process, since a localization of the electron by an atomic 
size a in a direction transverse to its motion produces an uncertainty in 
the transverse momentum component of amount h/a and an angular 
spread of order h/ap ^ l/k Q a. 

Formulation of the Problem. The nucleus of the first atom can 
without loss of generality be placed at the origin, and that of the second 
atom at the point R. The two atoms are assumed to be far enough apart 
so that the interaction between them can be neglected. The unperturbed 
Hamiltonian is then the sum of the kinetic energy of the incident electron 
and the unperturbed Hamiltonians of the two atoms. The perturbation 
is the sum of the interaction H[ between the incident electron and the 
first atom, and the interaction H' 2 between the electron and the second 
atom. In the initial state, both atoms are in their ground states w 0 with 
energies €o, and the incident electron has the propagation vector k 0 . In 
the final state, the first atom is in the state u n with energy e n , the second 
is in the state Um with energy e m , and the propagation vector of the elec¬ 
tron is k nm . 

It is apparent that the transition in which we are interested cannot 
occur in first order. It can occur in second order, and there are two 
groups of intermediate states. In the first group, the first atom is in the 
state u n , the second in the state Wo, and the incident electron has some 
propagation vector k n0 . In the second group, the first atom is in the state 
Wo, the second in the state u mj and the propagation vector of the electron is 
called k 0m . Thus the second-order matrix element (29.20) is 


V (H' 2 ) nm , nQ (H[) nQM Y 

E oo E n Q L-j E 0 0 — Eq 


)om,00 

0 m 


p _ o, 4. **8 
£°° - 2eo + 


E, 


0 m 


EnO —■ € n + €o + 

M\ m 


Ml 


nO 


2 m 


€0 + + 


2 m 


(30.12) 
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We evaluate only the first sum in (30.12) explicitly and indicate the 
ch-&>uges that are to be made in the result to obtain the second sum. The 
matrix elements that appear there are 

(«) nm,nO 

= 2) exp (— ik nm • r)ffj(2,r)w 0 (2) exp (zk n o • ^dr^dr 

(iyi)n0,00 ( 30 - 13 ) 

= L” 3 //^n(l) exp (— ik n0 * r / )//i(l J r / )w 0 (l) exp (zk 0 * r')dridT' 


H^re 1 and 2 denote all the internal coordinates of the first and second 
atoms, respectively, and dr x and dr 2 are the corresponding elements of 
itx-begration; r and r' are variables of integration with volume elements 
&r and dr *, both of which specify the position of the incident electron with 
respect to the origin. An integral over 1 has been performed in the first 
of Eqs. (30.13) to give unity, and a similar integral over 2 has been per¬ 
formed in the second equation. 

Evaluation of the k Sum. When the matrix elements (30.13) are 
substituted into the first sum of (30.12) and the sum and integrals are 
interchanged, we must evaluate 



exp ?k n 


*So 



* 2 ■ ** “ On - c 0 ) 


(30.14) 


For a box of sufficiently large dimensions L, we can replace the summa¬ 
tion in (30.14) by an integration: 


A Y f eX P ’ ( r ~ r 0 

;2t) J k* Q - k 2 dTk ”° 


(30.15) 


THe integral in (30.15) has the form of that which appeared in the Green's 
function (20.12) for a free particle. Thus the only new problem pre¬ 
sented by the evaluation of (30.15) is the determination of the contour 
thi-Ebt takes proper account of the singularity of the integrand at k n0 = k. 

This singularity is of the type discussed near the end of the last section 
and arises from the possibility of first-order transitions in which just 
ono of the atoms is excited. Equation (29.24) shows that the proper 
contour to use for the integral over the magnitude k n0 of the vector k n0 is 
ono that goes from 0 to + co by passing under the real axis at the pole*. 
Aft,er the angular integrations are performed, the integrand is even in 
kn.n 9 and this contour can bo reflected in the origin. The resulting con- 
tour, which extends from — oo to is exactly the same as that used 
in bhe evaluation of (29.13) and shown in Fig. 21a. We therefore obtain 
from the earlier work an explicit expression for the summation (30.14): 


‘l7r[r 


— r 


exp v‘#cjr — r'| 


(30.16) 
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Second-order Matrix Element. It is convenient to rewrite the matrix 
elements (30.13) in terms of new functions 

F m (t - R) = /u m (2)H'(2 J r)« 0 (2)dr 2 
F„(r') = K(l)tfi(l,r')«o(l)dn 

These functions are very small except when their arguments differ from 
zero by distances of the order of the size of the atom. We put 

r" = r — R, 

so that practically all the contribution to the first summation of (30.12) 
comes from small values of r' and r". We can then approximate 

p . r n p . r t 

|r - r'| = |R + r" - r'| g*R + ±JL- ±JL 
[r - r'| -1 ^ R- 1 

to obtain the leading terms in (30.16) for large R . The first summation of 
(30.12) then becomes, to this approximation, 

_ 2m __j__ exp z(k -k„ w ) _ -R j n(f , )[exp l(k() _ k) . t > W 

• J F m (t")[exp i (k - k„,„) ■ r "]dr" (30.17) 

where k is a vector of magnitude k given by (30.14), that has the dirotition 

of R. 

In similar fashion, the second summation of (30.12) becomes 
_ 2m _j_ expf(tt' + ko) -R J n(r , )[exp _ i{v! + k .) . r '],/ T ' 

• J F m ( r")[exp t(k 0 + k') • r "]tk" (30.18) 

where k' is a vector in the direction of R whose magnitude is given by 
(30.14) with € n replaced by € m . 

The differential cross section is obtained by substituting the sum of 
(30.17) and (30.18) for II^ in (30.6), and replacing k by k nm . Oon- 
servation of energy requires that 

*8 - ^ («• + e. - 2eu) 


Discussion of the Cross Section. The integrals that appear in (30.17) 
and (30.18) have the characteristic structure associated with the per¬ 
turbation treatment of collision problems. They are very small unless 
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the propagation vector that appears in the exponent of the integrand has 
a magnitude that is of order 1/a or less, where a is a typical linear dimen¬ 
sion of the atom (F significantly different from zero). It follows that 
(30.17) is significant only when the vectors k 0 , k, and kw are nearly equal 
in magnitude and direction. Because of the assumption that the inci¬ 
dent electron is fast, the magnitudes are very nearly equal in any event. 
Then the cross section is appreciable only when the vectors R and k nm are 
nearly parallel to k () . The permitted angular deviation from parallelism 
is easily seen to be of order l/k Q a. 

In similar fashion, it follows that (30.18) is significant only when 
k', and hence R, is nearly antiparallel to both k 0 and k nm , in which case 
the latter two vectors are nearly parallel to each other. 

The two terms together show that excitation of both atoms occurs 
with appreciable probability only when the line joining the two atoms is 
nearly parallel to the direction of the incident electron. It is apparent 
also that the cross section falls off inversely as the square of the distance 
R between the two atoms, as would be expected. 

31. ADIABATIC AND SUDDEN APPROXIMATIONS 

In this section we develop approximation methods that involve the 
rate of change of the Hamiltonian, rather than the magnitude of the time- 
dependent part of the Hamiltonian. If the Hamiltonian changes very 
slowly with the time, we expect to be able to approximate solutions of 
the Schrodinger equation by means of stationary energy eigenfunctions 
of the instantaneous Hamiltonian, so that a particular eigenfunction at 
one time goes over continuously into an eigenfunction at a later time 
(adiabatic approximation). If the Hamiltonian changes from one steady 
form to another over a very short time interval, we expect that the wave 
function does not change much, although the expansion of this function 
in eigenfunctions of the initial and final Hamiltonians may be quite dif¬ 
ferent, (sudden approximation). We determine here to what extent both 
of these typos of approximation are valid. 

Expansion in Instantaneous Energy Eigenfunctions. We consider 
first the adiabatic approximation, and wish to solve the Schrodinger 
equation 

ih % = a m (3i.i) 

al 

when //(/) varies slowly with the time. 1 The solutions of the energy 
eigenvalue equation at each instant of time are assumed to be known. 

mOunO) = KMM) (31.2) 

1 M. Horn and V. Knelt, Zrits. f. ITiyxik, 61, 165 (1U28); P. Ottltinger, Z<uts. f. 
Phyaik , 73, 161) (11)31). 
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We assume also that the u n are orthonormal, nondegenerate, and dis¬ 
crete; their phases are fixed below. 

Suppose that the wave function is known at zero time; at later times 
we put 

4> = ^ a n (t)u n (t) exp [ - \ J* E n (t')dt '] (31.3) 

71 

Substitution of (31.3) into (31.1) gives 
ih (d n u n + «» ~ % a n u„E^j exp | ~ ft f Q E n (if)dt' j 

H/> 

Since Hu n = E n u n from (31.2), the last term on the left side cancels the 
right side. Multiplying through on the left by u h and integrating over 
all the coordinates of the system (Jdr), we obtain 

d h = - ^ a» exp [j J* (E k - E,w\ • J u k ^ dr (31.4) 

n 

We now seek an expression for the integral on the right side of Eq. 
(31.4) that is easier to interpret in physical terms. Differentiation of 
(31.2) with respect to t gives 


= H ^ a n u n 



dH ,, _i_ v dUn 


I 77T du n 

-df Un + En -di 


Multiplying through on the left by u k and integrating over the coordi¬ 
nates gives 

j H-k — u n dr + j u k H ^fdr = Bn J U„ dr, k * n (31.5) 

We make use of Eq. (22.10) to rewrite the second integral on the left 
side of Eq. (31.5), and remember that H is Hermitian. 

/ a * H tt * - / < Ha *> if * - * / ”* if * 

Substitution into (31.5) gives an expression for the integral on the right, 
side of (31.4) 



n ^ k (31.6) 
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Choice of Phases. In order to rewrite Eq. (31.4) along the lines 
indicated in the last paragraph, we must have an expression for 

/^ir* 

We now show that this integral is pure imaginary, and that a proper 
choice of the dependence of the phase of u n on t makes it zero. Differen¬ 
tiation of the normalization integral for u n gives 

0 " Jt { flnUndr = / -af UndT + / itr Ar 

Since the two integrals on the right are complex conjugates of each other, 
each must be pure imaginary: Ju n (du n /dt)dr = ia(t). 

We now change the phase of u n by an amount 7 (/), which is permissible 
since the phases of the eigenfunctions are arbitrary at each instant of 
time. For the new eigenfunction u f n = u n e iy(t) , 

J < ^ dr = J u n ii~ iy ~ (u„e iy )dr = ia(t) + i y(t) (31.7) 

Thus the choice y{t) — — a(t')dt' for the phase makes the integral on 

the left side of (31.7) vanish. In what follows, we assume that u' n has 
been substituted for u n , with a consequent change in Eqs. (31.6), and 
omit the primes. 

We adopt our earlier notation ha> kn = Eh — E n) and substitute (31.6) 
into (31.4): 

n 

where the prime on the summation indicates that, the term n = k is 
excluded from the summation. The last term on the right side of Eq. 
(31.8) is the kn matrix element of dll/dt . 

Adiabatic Approximation. The group of Eqs. (31.8) for all h ’s is 
exactly equivalent to the Schrodinger equation (31.1). We now estimate 
the order of magnitude of a k by assuming that all the quantities 
(a n , cojfcn, Un , dll/dt) that appear on the right side of (31.8) are constant in 
time. If further we assume that the system is in the state m at t = 0, 
we can put a„ = 5 nw . We thus obtain 



k m 
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which is readily integrated to give 


k 7 * m (31.9) 


With the above approximations, Eq. (31.9) shows that the probability 
amplitude for a state other than the initial state oscillates in time and 
has no steady change over long periods of time even though H changes 
by a finite amount. If the change in H during the Bohr period for the 
transition m —> k is small in comparison with the energy difference 
between these two states, the transition is unlikely to occur. The change 
in amplitude of the state k after a long time is of the order of the ratio of 
these two energies. 


\a k \ ~ 


(l/co hm )(dH/dt) 
Ek — E m 


(31.10) 


Connection with Perturbation Theory. An exceptional situation 
arises when the Hamiltonian oscillates in time with a frequency nearly 
equal to one of the transition frequencies, say co km . This is a case of 
resonance, and we expect from the discussion of Sec. 29 that even a very 
small change in H can produce appreciable changes in the amplitude a k 
over long periods of time, so that (31.10) is not valid. It is then no 
longer permissible to assume that the time dependence of dH/dt can be 
neglected, andjdie passage from (31.8) to (31.9) is not justified. 

In order to consider this case more carefully, we assume that only a 
small part of H oscillates in time with an angular frequency co that is 
close to co bm : 

n TT 

H = H q + H r sin cot, — = colV cos col 


where H r is small in comparison with H Q and both of these are constant in 
time. If then the dependence of a n , co knt and u n on time is neglected and 
we put a n = 5nm as before, Eq. (31.8) becomes 


a k ^ 


aH *» cos cot e<aw 
hoikm 

TT t 

* hm e i(, ulm -w)n 

2 hco km L j 


This is readily integrated to give 


MO 


1 kin 


2 ihco t 


km 


V 


(tdfcm-hu)/ _ 


&km 


H“ CO 




(Ohm “ CO 


(31.11) 


This shows that the adiabatic approximation (31.10) breaks down for 
«awi — ±w, since then (31.11) increases steadily with the time. Tf is 
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close to +o>, the first bracket term can be neglected and we can replace 
oj/c oicm outside the bracket by +1; if co*™ is close to — w, the second bracket 
term can be neglected and we can replace w/Wn by — 1. In both cases, 
we see that (31.11) agrees with the perturbation-theory result given in Eq. 
(29.17). 

Discontinuous Change in H. As an introduction to the sudden 
approximation, we consider first a situation in which the Hamiltonian 
changes discontinuously from one form that is constant in time to another. 
Suppose that H — Ho fort < OandH = Hi fori > 0, where 

HoU n = EriUnj H lV m = E m v m 

and the u’ s and v’s are complete orthonormal sets of functions that are 
not necessarily discrete. The general solutions can be written 


- 

yj/ — ^CLn'U'n& ^ 
iE„t 

$ = A 


t < 0 
t > 0 


(31.12) 


where the a’s and f/s are independent of the time. 

Since the wave equation (31.1) is of first order in the time, the wave 
function at all points in space must be a continuous function of the time 
at t = 0, although its time derivative is not. The b ’s are then readily 
expressed in terms of the a’a by equating the two solutions (31.12) at 
t = 0, multiplying by a particular v } and integrating over the coordinates: 

bm = Vtn'llntf-T (31.13) 


The appearance of final states m that need not have the same energy as 
an initial state is a consequence of the noil-zero frequency Fourier com¬ 
ponents into which the suddenly changing Hamiltonian can be resolved 
(see Sec. 29). 

Sudden Approximation. The sudden approximation consists in using 
I0qs.(3l.l3) when the change in the Hamiltonian occupies a very short 
but finite interval of time / 0 . In order to make an estimate of the error 
introduced in b m we consider a problem that, while somewhat artificial, 
can easily be solved formally. Suppose that II = Jf {] for t < 0, II = Hi 
for / > and II = //< for 0 < t < £». The intermediate Hamiltonian, 
which is assumed to be constant in time, has a complete orthonormal set 
of energy eigenfunctions: 

HiW h = EicWk 

The true solution can be expanded in terms of the w 9 s with constant 
coefficients: 


\p = S CkW 


iEkf 

ft 


0 < t < to 
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The continuity condition at t = 0 gives 

Cjo = (31.14) 

In similar fashion, the continuity condition at t = to gives, with the help 
of (31.14), 

_ i(Eh — E m )tn 

b m = S kfikS^WiA'r' • e * 

_ i(Ek — Em)ta 

= S*S" a »/ W m u n drj• e A 

__ j(Eh — Em)to 

= Snanff^S^wne * ]w„dTrfT' (31.16) 

where the primes denote a different set of coordinate variables of integra¬ 
tion. The closure relation (10.11) shows that the bracket in the last 
term of (31.15) becomes a product of 5 functions of the differences 
between primed and unprimed coordinates when to = 0, in which case 
the expression for b m agrees with (31.13), as it should. 

The difference between the exact expression (31.15) for b m and the 
approximate expression (31.13) is measured by the difference between 
exp [— i(E k — E m )to/h] and unity. , This is small if to is small in compari¬ 
son with all the periods h/(E h — E m ) that correspond to the eigenfunc¬ 
tions k and m that appear when H changes. 

A useful validity criterion is that to be small in comparison with the 
periods associated with the initial motion, since new states of motion that 
have very much shorter periods (high energies) are excited with relatively 
small amplitudes. When the sudden approximation is useful, the error 
in b m (and hence in ^) is of the order of the ratio of t {) to a typical initial 
period. 

Transient Disturbance. An interesting special case of (31.15) is that 
in which initial and final Hamiltonians are the same (Hi = H 0) = u m ) 

and the system is initially in a particular state n. Then if to is short 
enough to satisfy the validity criterion of the last paragraph, we can 
expand the exponential in the last member of (31.15) and retain only the 
first two terms. 


b — 


J j UmSkW'kWk 

1 - ^ (E h - E m ) 

J J KSkW'kWk 

1-1 

1 

5 ? 

1 

t-H 


u n (hdr' 


u n drdT f 


With the help of the closure relation, the orthogonality of u m and u n when 
m n, the substitution E^dJU = Hotim, and Eq. (22.10), this can be 
reduced to 
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Hq^UhJCIt j TYh 7 ^ W (31.10) 

Equation (31.16) can be generalized to a situation in which Hi depends 
on the time; in this case a result that is also correct to first order in t 0 

is obtained by replacing Hit 0 by j* H4t. 

It should be noted that the expression (31.16) for b m can be useful 
even when Hi — H 0 is not small in comparison with i7 0 , provided that 
the general criterion for the validity of the sudden approximation is 
satisfied ( t Q sufficiently small). On the other hand, the perturbation 
theory (Sec. 29) is useful when a small time-dependent addition to the 
Hamiltonian is applied for a long time. 

Disturbance of an Oscillator. As a simple example of the application 
of the approximation methods developed in this section, we consider a 
linear harmonic oscillator in which the position of the equilibrium point 
a(t) depends on the time. The Hamiltonian for this system is 

The instantaneous energy eigenfunctions are the harmonic-oscillator wave 
functions (13.13) centered at the point a(£), and the energy levels are 
unchanged: 

u n (x) = N u H n [a(x — a)]c“ *«*<*-»>*, E n = (n + £)Aco 0 

We suppose first that the equilibrium point moves slowly, and 
investigate the circumstances under which the adiabatic approximation is 
applicable. If the oscillator is initially in its ground state (a = 0), the 
time derivative of the Hamiltonian dll/dt = — K(x — a)d has a non¬ 
vanishing matrix clement only with the first excited state. With the 
help of (13.18) this is found to be 

Substitution into (31.9) shows that the coefficient of the time-dependent 
factor in the amplitude of the first excited state has the magnitude 

Kd (jA)* = _ a _ 

hu 2 0 (7im)l 

This expression may be interpreted physically by noting that the denomi¬ 
nator is of the order of the maximum speed of a hypothetical classical 
oscillator that has the zero-point energy. Thus the adiabatic approxima¬ 
tion is good if the equilibrium point moves slowly in comparison with the 
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classical-oscillator speed. It is easily seen that for the nth excited state, 
the equilibrium-point speed must be small in comparison with 1 fn times 
the corresponding classical-oscillator speed. 

The sudden approximation can be applied to an oscillator in its ground 
state when the time required to move the equilibrium point from one 
steady position to another is small in comparison with l/a> c . If this point 
is displaced a distance a in the positive direction along the line of motion, 
application of (31.13) shows that the probability amplitude for the nth 
state after the displacement is 

— [ u n (x — a)er^ a% ^dx = % f u n {x ) * “ 2 C:c+a) *dx 

n J - * tv* J ~ * 

This integral is identical with the expression for A n in Eq. (13.21), except 
for the sign of a, and has already been evaluated with the help of the 
generating function (13.10) for the Hermite polynomials. The earlier 
discussion (Sec. 13) shows that the states most likely to be excited are 
those that have a classical amplitude of oscillation that is of the order of 
the displacement a ; this is in agreement with the corresponding classical 
result. 

Problems 

1. A hydrogen atom in its ground state is placed between the platen of a condenser. 
A voltage pulse is applied to the condenser so as to produce a homogeneous electric 

__ t 

field that 1ms the time dependence: E = 0, t < 0; E = E (1 c t, l > 0. Kind the first- 
order probability that the atom is in the 2 S state (200) after a long time. What 
is the corresponding probability that it is in one of the 2 P stab's? 

2. An alternating voltage of angular Frequency co > nu A /2h 3 is applied to the 
condenser of Prob. 1. What is the probability per unit time for the hydrogen atom 
to make a transition from its normal state to an ionized state? Assume, only for the 
purpose of this problem, that the electronic wave function for the ionized state can he 
represented by a plane wave. 

3. Extend Eq. (20.20) to the case in which a transition can occur only in third 
order of the perturbation. Assume that none of the intermediate states 1ms the same 
energy as the initial and final states. 

4. Use the perturbation theory to calculate the differential collision cross section 
for the 1$ —> 2S excitation of a hydrogen atom. Integrate this to obtain the total 
cross section, and show that it becomes the expression given in Sec. 30 at liigh bom¬ 
barding energy. 

5. Use the perturbation theory to calculate the differential collision (toss section 
for the 1*S —> 2 P excitation of a hydrogen atom. Show that, the total cross section 
becomes the expression given in Eq. (30.11) at high bombarding energy. 

6. Discuss the statement that appears at the end of the next to the last, paragraph 
of Sec. 31. In particular, show physically why it need not he a sufficient condition 
for the applicability of the adiabatic approximation that the equilibrium-point speed 
be small in comparison with the corresponding classical-oscillator speed. 
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7. Under what circumstances is $(t) =* [exp ( — iHl/h)]^(G) a valid representation 
of the solution ^ at time t in terms of the Hamiltonian H and the solution at time 
zero? Show that, in general, the operator Ht in the exponent cannot be replaced by 

jf* Hdt f . Show, however, that HiU in Eq. (31.16) can be replaced by J q ° Hidt to first 

order in to . 

8. A hydrogen nucleus of mass 3 is radioactive, and changes into a helium nucleus 
of mass 3 with the emission of an electron that has about 17,000 electron-volts energy. 
Show that the sudden approximation can be applied to the extranuclear electron that is 
initially present in the hydrogen atom. Calculate the numerical values of the prob¬ 
abilities that the resulting helium ion is found in its 1£, 2 S, and 2 P states if the hydro¬ 
gen atom is initially in its 1& state. 



CHAPTER IX 

IDENTICAL PARTICLES AND SPIN 


The quantum-mechanical theory of particles presented thus far is 
deficient in three respects. First, whenever two or more particles are 
described at once, like the electron and proton of the hydrogen atom 
(Sec. 16) or the incident and atomic electrons in an inelastic collision 
(Sec. 30), it is assumed that the pai'ticles can be distinguished from each 
other. This is .a valid assumption in the first example, since electrons 
and protons possess quite different masses and electrical charges. In 
the second example, however, there is no observable difference between 
the incident and atomic electrons, and the consequences of this identity 
should appear in the formalism. The second defect of the theory is the 
omission of an intrinsic spin angular momentum, or spin , which is actually 
possessed by the particles found in nature. 1 Third, no mention has as 
yet been made of the special theory of relativity, which is expected to 
affect the theoretical description of particles that move with speeds close 
to that of light. 

The ways in which the first two of those defects can be remedied arc 
described in this chapter, and illustrative examples are discussed, 
Relativistic effects are taken up in Chap. XII. 

32. IDENTICAL PARTICLES 

Identical particles cannot be distinguished by means of any inherent 
property, since otherwise they would not be identical in all respects. 
In classical mechanics, the existence of sharply definable trajectories for 
individual particles makes it possible in principle to distinguish between 
particles that are identical except for their paths, since each particle can 
be followed during the course of an experiment. In quantum mechanics, 
the finite size and the spreading of the wave packets that can describe 
individual particles often make it impossible to distinguish between 
identical particles because of their positions, especially if they interact 
with each other to an appreciable extent. This is true of the electrons 
in a single atom, where we have seen that the description in terms of 
moving wave packets breaks down completely. However, the electrons 
of different atoms that are well separated from each other may, to good 

1 Spin was first discovered in connection with electrons, by G. 10. Uhlcnheck and 
S. Goudsmit, Nalurmiss 13, 953 (1925); Nature, 117, 264 (1926). 
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approximation, be regarded as distinguishable. This section considers 
some of the effects of identity on the quantum-mechanical treatment of 
systems of two or more particles. Other effects that involve the spin 
explicitly will be taken up in the remainder of this chapter. 

Physical Meaning of Identity. The impossibility in principle of dis¬ 
tinguishing between identical particles in most quantum-mechanical 
problems can lead to effects that have no classical analogue. As an 
example, we compare the elastic collision of two identical particles that' 
have a particular interaction between them, with the collision of two 
different particles that have the same interaction between them. 

In a classical treatment, there is no difference of principle between the 
results of these two experiments, since it is possible to distinguish between 
the incident and struck particles in the first case as well as in the second. 
In practice, however, this distinction would usually be made only in the 
second experiment. Thus, according to classical mechanics, the meas¬ 
ured differential cross section in the first experiment is equal to the sum 
of the corresponding cross sections measured for the incident and struck 
particles in the second experiment. In the corresponding quantum- 
mechanical situation, the identical particles in the first experiment cannot 
be distinguished by means of their trajectories, since they cannot be well 
localized without interfering with the scattering process. Thus the dis¬ 
tinction between incident and struck particles has no physical significance, 
and the simple connection between the results of the two experiments that 
is found in the classical case need not exist. 

We use the word identical to describe particles that can be sub¬ 
stituted for each other under the most general possible circumstances 
with no change in the physical situation. Identical particles can in some 
cases be distinguished from each other, as when their wave packets do 
not overlap. Another case, discussed more fully in Sec. 33, arises when 
each of the particles possesses an intrinsic spin angular momentum, 
which is a constant of the motion in a particular collision. Then since 
the component of the spin along some axis is assumed not to change dur¬ 
ing this collision, the particles can be distinguished if they have different 
spin components. Results of this kind must, of course, be a consequence 
of the formalism that we now set up. 

Symmetric and Antisymmetric Wave Functions. The Schrodinger 
wave equation for n identical particles is 

(1,2, . . . ,n; t ) = 7/(1,2, . . . ,»)*( 1,2, t) (32.1) 

where each of the numbers represents all the coordinates (positional and 
spin) of one of the particles. The Hamiltonian IJ is symmetrical in its 
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arguments, since the identity of the particles means that they can he 
substituted for each other without changing H. 

There are two kinds of solutions 4 / of Eq. (32.1) that possess symmetiy 
properties of particular interest. A wave function is symmetric if the 
interchange of any pair of particles among its arguments leaves the wave 
function unchanged. A wave function is antisymmetric if the interchange 
of any pair of particles changes the sign of \f/. We now show that the 
symmetry character of a wave function does not change in time. If 
\ps is symmetric at a particular time t , then H\f/ S is also symmetric, and 
(32.1) states that d$n/dt is symmetric. Since \ps and its time derivative 
are symmetric at time t , fa at an infinitesimally later time t + dt is given 
by i/'s + ( d\ps/dt)dt , and is also symmetric. Such a step-by-step integra¬ 
tion of the wave equation can, in principle, be continued for arbitrarily 
large time intervals, and \ps is seen to remain symmetric always. In 
similar fashion, if fa is antisymmetric at any time, H\pA and hence 
dyp A /dt are antisymmetric, and integration of the wave equation shows 
that }pA is always antisymmetric. 

The foregoing proof is not altered if H and \p have as their arguments 
the coordinates of two or more different groups of identical particles; 
thus a wave function that is initially set up to be symmetric or anti¬ 
symmetric in the coordinates of each identical-particle group always 
retains this character. This makes it possible for the different groups of 
identical particles found in nature to have definite symmetry characters, 
and this is actually found to be the case. Electrons, protons, and neu¬ 
trons are the only material particles for which the experimental evidence 
is unambiguous, and each of these kinds of particles is described by anti¬ 
symmetric wave functions. 

Construction from Unsymmetrized Functions. We now show how 
fa or }p A can be constructed from a general unsymmelrized solution \f/ 
of Eq. (32.1). If the arguments of 4 / are permuted in any way, the result¬ 
ing function is a solution of (32.1). That this is true follows from the 
observation that the same permutation applied throughout Eq. (32.1) 
does not impair its validity, since it corresponds simply to a relabeling 
of the particles; then since II is symmetric,, the permuted II is the same 
as the original H, and the resulting equation is the same as (32.1) for 
the permuted 4/. In this way n! solutions can be obtained from any one 
solution, each of which corresponds to one of the a! permutat ions of the n 
arguments of It is evident that any linear combination of these func¬ 
tions is also a solution of the wave equation (32.1). 

The sum of all these functions that are linearly independent is a sym¬ 
metric (unnormalized) wave function fa, since the interchange of any 
pair of particles changes any one of the component fund ions into another 



Sec. 32] 


IDENTICAL PARTICLES AND SPIN 


219 


of them and the latter into the former, leaving the entire wave function 
unchanged. An antisymmetric unnormalized wave function can be con¬ 
structed by adding together all the permuted functions that arise from 
the original solution by means of an even number of interchanges of pairs 
of particles, and subtracting the sum of all the permuted functions that 
arise by means of an odd number of interchanges of pairs of particles in 
the original solution. It is apparent that a nonvanishing antisymmetric 
wave function cannot be formed from a solution that is unaltered by the 
interchange of any pair of particles. 

In the event that the Hamiltonian does not involve the time, station- 

_ \Et 

ary solutions . . . ,n; t) = u( 1,2, . . . ;n)e h can be found, 

where 

[H{ 1,2, . . . ,n) - E]u{ 1,2, . ■ . ,n) = 0 

The earlier discussion shows that the solutions derived from any u by 
means of permutations of its arguments are degenerate with the original 
u) this is called exchange degeneracy. When n = 2, the 21 = 2 permuta¬ 
tions result in w(l,2) and w(2,l); the symmetric and antisymmetric com¬ 
binations are obtained by taking the upper and lower sign, respectively, in 

m(1,2) ± u( 2,1) (32.2) 

When n = 3, the 3! = 0 permutations yield 7/(1,2,3), u( 2,1,3), u( 3,2,1), 
w(l,3,2), u(2,3,l), and u( 3,1,2); the symmetric and antisymmetric com¬ 
binations are 

Kl,2,3) + u(2,3,l) +u(3,l,2)] 

± [77(2,1,3) + 7/(1,3,2) + 77(3,2,1)] (32.3) 

with the upper and lower sign, respectively. 

All the energy eigenfunctions that are exchange degenerate with 
m( 1,2) can be formed from the two solutions (32.2). When n = 3, 
however, there are four linearly independent, eigenfunctions that cannot be 
formed from the two functions (32.3). These additional solutions, which 
always appear when n > 2, can be chosen so as to possess permanent 
symmetry characters that resemble but are somewhat more complicated 
than those of the symmetric and antisymmetric solutions. However, 
they do not appear to describe particles found in nature. 

Distinguishability of Identical Particles. It is to be expected that 
the result of an experiment is independent of the symmetry character of 
the wave function if the coordinates of the particles do not overlap. This 
corresponds to a situation in which the particles can be distinguished by 
means of their positions (or their spin components) even though they are 
identical. Such a situation implies, in the case of two particles, that the 
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wave function u(l,2) is different from zero only when the coordinate 1 iB 
in some region A, the coordinate 2 is in a region B, and A and B have no 
common domain. 

The coordinate probability density associated with the wave function 
u( 1,2) is |u(l,2)| 2 , and the densities associated with the symmetrized 
wave functions (32.3) are 

|w(l,2) ± u(2,l)| 2 = |w(l,2)| 2 + !m(2,1)| 2 ± 2Re[u(l,2)u(2,l)] (32.4) 

where Re denotes the real part of the expression in brackets. If now 
m( 1,2) vanishes whenever 1 is not in A and 2 is not in B, and A and B do 
not overlap, the bracket term is zero everywhere, and (32.4) becomes 
|t*(l,2)|* + K2,l)| 2 . 

Thus the density associated with either of the symmetrized wave 
functions (32.2) is the sum of the densities associated with u(l,2) and 
u(2,l) separately. This is precisely the result that will be obtained if 
the particles are not identical but no attempt is made to distinguish 
between them in performing the experiment. Thus the interference 
effects between exchange-degenerate wave functions, represented by the 
bracket term in (32.4), disappear when the coordinates of the particles 
do not overlap. 

The Exclusion Principle. In many problems, a useful zero-order 
approximation can be obtained by neglecting the interactions between the 
particles that make up the system under consideration. The approxi¬ 
mate (unperturbed) Hamiltonian is the sum of equal Hamiltonian func¬ 
tions for the separate particles 

H„(l,2, • • • ,n) = HS(1) + H$(2) + • ■ • + H'„(n) (32.5) 

and the approximate energy eigenfunction is a product of one-particle 
eigenfunctions of HJ 

w(l,2, . . . ,n) = a a (l)v@(2) . . . »>„(«,) 

E = E a + + • • • + K, (32.6) 

HJ(l)i>.(1) = E a v a ( 1), etc. 


If the particles are electrons, an antisymmetric wave function must be 
constructed from the u given by (32.6). This is most easily expressed 
as a determinant of the r’s: 


wa( 1,2, ...«) = 


y«(i) 

v a (2) 

■ ■ ■ !’„(») 

ffi(t) 

ve(2) 

• • • t’n(n) 

a„(l) 

v,(2) 

■ ■ ■ »;„(«) 


(32.7) 


The (unnormalized) Ua. given in (32.7) is clearly an antisymmetric solu¬ 
tion of the approximate wave equation (Ho — E)u A = 0. 
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Equation (32.7) has the interesting property that it vanishes if two 
or more of the v’s are the same. This is a special case of the general 
result stated earlier that an antisymmetric wave function cannot be 
constructed from a solution that is unaltered by the interchange of any 
pair of particles. Thus the approximate Hamiltonian H 0 has no solu¬ 
tions for which there is more than one electron in any one of the states 
a, ft, . . . , v. This result is known as the exclusion principle and was 
first postulated by Pauli 1 as an explanation of the periodic system of the 
chemical elements (see Sec. 38). 

Connection with Statistical Mechanics. The unsymmetrized zero- 
order solution given in Eq. (32.6) can be used to construct a symmetric 
as well as an antisymmetric wave function. Such a symmetric (unnor¬ 
malized) function is easily seen to be the sum of all different permutations 
of the numbers 1,2, . . . 3 n among the one-particle eigenfunctions 
v a , Vfa , v v . This wave function is unique, and can be specified 

simply by stating how many particles are in each of the states a, /S, . . . . 
In the same way, an antisymmetric wave function can be specified by 
stating how many particles are in each state. The fundamental statis¬ 
tical difference between particles that are described by antisymmetric 
and by symmetric wave functions is that the number of the former type 
that can occupy any state is limited to 0 or 1, whereas any number 
(0,1,2, . . .) of the latter type of particles can occupy any state. 

The treatment of aggregates of large numbers of noninteracting (or 
weakly interacting) particles for which the states can be enumerated in 
these two ways forms the subject matter of quantum statistical mechanics. 
Particles that are described by antisymmetric wave functions are said to 
obey Fermi-Dirac statistics , and particles that are described by symmetric 
wave functions obey Einstein-Bosc statistics .* 

While all the material particles whose statistics arc definitely known 
(electrons, protons, and neutrons) obey Fermi-Dirac statistics, the Ein- 
stein-Bose statistics are not without physical interest. In the first place, 
light quanta or photons, in so far as they can be treated as particles, obey 
Einstein-Bose statistics even though they cannot usefully be described by 
means of wave functions. Second, certain aggregates of particles that 
are sufficiently tightly bound so that they can be regarded as “particles,” 
are described by symmetric wave functions. 

For example, the nucleus of a helium atom is made up of two protons 
and two neutrons, which are strongly bound together. If we consider a 
number of helium nuclei that interact with each other weakly enough so 

1 W. Pauli, Zoita. f . Phyaik, 31, 765 (1925). 

2 See, for example, R. C. Tolinan, “The Principles of Statistical Mechanics,” 
Chap. X (Oxford, New York, 1938). 
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that the changes in the internal motions of the nuclei can be neglected, 
we can see that the motions of the centers of gravity of the nuclei can be 
described approximately by a symmetric wave function. The inter¬ 
change of a pair of helium nuclei can be thought of as the resultant of 
four interchanges: two pairs of protons and two pairs of neutrons. Since 
the actual wave function is antisymmetric in all the protons and in all the 
neutrons, the resultant of these four interchanges leaves the approximate 
wave function unchanged. By an extension of this argument, we see 
that weakly interacting “particles” (nuclei, atoms, or molecules) obey 
Einstein-Bose statistics when each of them consists of an even total 
number of electrons, protons, and neutrons, and obey Fermi-Dirac 
statistics when each consists of an odd total number of these particles. 1 

Collisions of Identical Particles. When the only forces acting on two 
particles result from their mutual interaction, the over-all motion can be 
separated into motion of the center of mass of the two particles and 
motion of the particles relative to each other, as discussed in Secs. 16 and 
18. It is apparent that an interchange of two identical particles does not 
affect the position vector of the center of mass [which is £(r L + r 2 ) since 
the particles have equal masses], but changes the sign of the relative 
position vector r (= ri — r 2 ). We postpone consideration of the spins of 
the particles until the next section, and see now what elTcct symmetry 
or antisymmetry of the space part of the wave function has on the elastic 
scattering of a particle by another that is identical with it. 

The asymptotic form of the unsymmetrized scattering wave function 
in the center-of-mass coordinate system is given by Eq. (18.10). 

u( r)-► <^* + r~ i S(d,4>)c ihT (32.8) 

r—> w 

where r,0,<£ are the polar coordinates of the relative position vector r. 
Since the polar coordinates of the vector — r are r, r — 0, $ + 7r, the 
asymptotic forms of the symmetric and antisymmetric wave functions 
formed from (32.8) are given by 

(a ik * ± e-**) + [/(0,<£) ± /Or — 0, + 7r)]r“V'* r (32.9) 

with upper and lower signs, respectively. 

From the discussion of Sec. 18, it follows that the differential scatter¬ 
ing cross section in the eenter-of-miiss coordinate system is the square of 
the magnitude of the bracket term in (32.9): 

8=5 l/(0 J < fOI 2 + I/Or ” 4> + 7r ) 1“ 

± 27?c[/(0,^/(7r - 0, + tt)] (32.10) 

1 A more rigorous treatment that leads to the same conclusion luis been given by 
P. Ehrenfest and J, It. Oppenheimer, Phys , Rev., 37, 333 (1031). 
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The normalization adopted here can be justified by noticing that in the 
classical limit, where the identical particles are distinguishable and the 
last (interference) term in Eq. (32.10) drops out, <r(0,<£) becomes just 
the sum of the differential cross sections for observation of the incident 
particle (|/(0,4>)| 2 ) and of the struck particle (\f{r — 0, </> + 7r)| 2 ), as it 
should. 

In the usual case, for which f is independent of <£, it is apparent that 
the scattering per unit solid angle is symmetrical about 0 = 90° in the 
center-of-mass coordinate system. It is easily seen from Eq. (18.7) with 
7 = 1 that the scattering per unit angle (not per unit solid angle) in 
the laboratory coordinate system 

<ro(0u) sin 0o = 4 cos 0o sin 0 U {|/(20 n )| 2 

+ |/(t - 20(])| 2 ± 2Ec[/(20o)J(7r - 20 o )]] 

is symmetrical about 0 n = 45°. 

33. SPIN ANGULAR MOMENTUM 

The treatment of identical particles presented in the preceding section 
must now be supplemented by inclusion of the spin angular momenta of 
the particles. It was shown in Sec;. 24 that the operator M, which has 
the properties associated with an angular momentum, can be represented 
by matrices in an infinite number of ways. For each representation, 
M 2 and one component of M, say M Zy can be diagonalized; their eigen¬ 
values arc j(j + 1 )h 2 and the series jh, (j — l)fi, . . . , — jh, respec¬ 
tively, where 2 j is zero or a positive integer. If the expression for M 
in terms of the position and momentum of a particle (r X p) is abandoned, 
then M 2 can commute with the Hamiltonian for that particle. In this 
case, M 2 , and hence j, is a constant of the motion and characterizes the 
particle for all time; the corresponding intrinsic angular momentum is 
called the spin of the particle. We shall replace M by S and j by s in 
dealing with the spin. 

Connection between Spin and Statistics. All the material particles 
whose spin is definitely known (electrons, protons, and neutrons) have 
s = } 2 , Aggregates of particles that are sufficiently tightly bound can 
be regarded as “particles,” and can be characterized by definite magni¬ 
tudes of their total internal angular moment,a, so long as their internal 
motions and the relative spin orientations of their component particles 
art; not significantly affected by the interactions between aggregates. 
This is exactly analogous to the situation with regal’d to the statistics 
obeyed by the aggregates, discussed in the preceding section. 

The treatment of the addition of angular momenta, presented at the 
end of Sec. 2L, can be generalized to give the possible magnitudes of the 
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total internal angular momentum, which we call the spin, of any aggre¬ 
gate of fundamental particles. If the aggregate consists of n particles, 
each of which has s = £, and the internal orbital angular momentum of 
these particles is ignored, the total s can be any integer from 0 to -Jn if n 
is even, or can vary by integer steps from | to if n is odd. The total 
orbital-angular-momentum quantum number can be shown to be an 
integer or zero in general; 1 its inclusion extends the maximum value of s 
for the aggregate, but does not alter the conclusion that 5 is zero or an 
integer if n is even, and is half an odd integer if n is odd. 

We see then that for the known fundamental particles and for aggre¬ 
gates of them that have a definite spin, there is a unique connection 
between the spin and the statistics. Particles or aggregates that have 
zero or integer spin are described by symmetric wave functions and obey 
Einstein-Bose statistics, and particles or aggregates that have half-odd- 
integer spin are described by antisymmetric wave functions and obey 
Fermi-Dirac statistics. There is some theoretical reason, based on 
relativistic quantum mechanics, 2 to believe that this connection also 
holds for other fundamental particles whose existence is suspected but 
whose spin and statistics have not yet been determined (mesons, 
neutrinos). 

Spin Matrices and Eigenfunctions. The spin can be included in the 
formalism developed in Sec. 32 by having each of the numbers 1,2, . . . ,n 
that appear as the arguments of ^ and u represent a spin coordinate as 
well as the three space coordinates of that particle. The spin coordinate 
differs from the space coordinates in that it takes on only 2s + 1 values 
for a particle (or aggregate) of spin s, instead of the infinite number of 
values that are taken on by each space coordinate. Thus the “spin 
space” consists of a finite number of points. The spin wave function of a 
single particle is completely determined by the specification of 2s + 1 
numbers, whereas the space wave function involves the specification of a 
continuously infinite set of numbers (which is equivalent to a continuous 
function of the space coordinates). 

A convenient set of orthonormal one-particle spin functions is pro¬ 
vided by the normalized eigenfunctions of the M 2 and M z matrices given 
in equations (24.15). These eigenfunctions are (2s + l)-row, one-column 
matrices that have zeros in all positions except one. For example, if 
s = f, the four spin eigenfunctions are easily seen to be 

1 The work of Secs. 14 and 24 shows that this is true for nonintcracting particles 
that move in central force fields, and the result turns out not to be affected by particle 
interactions. 

1 W. Pauli, Phys. Rev 58, 716 (1940). 
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and correspond to S z eigenvalues of -|ft, %h, — -J-ft, and —fft, respectively. 
The orthonormality is demonstrated by multiplying the Hermitian 
adjoint of one spin function into itself or another function 



(0 10 0 ) 


0, etc. 


with the help of the usual rule for matrix multiplication. 

Symmetric or antisymmetric many-particle wave functions can be 
constructed from unsymmetrized solutions that include the spin by fol¬ 
lowing the procedure outlined in the preceding section. It is sometimes 
convenient to choose the unsymmetrized solutions to be eigenfunctions 
of the square of the magnitude of the total spin of the identical particles 
(Si + S 2 + * • • + S n ) 2 and of the z component of this total spin 
Si* + /S 2 * + - ' • + Sn*. These quantities are constants of the motion 
if the Hamiltonian does not contain interaction terms between the spins 
and other angular momenta. In addition, such functions are often useful 
as zero-order wave functions when the spin interactions are weak enough 
to be regarded as a perturbation. There is no loss of generality in choos¬ 
ing the unsymmetrized solutions in this way, since in the absence of spin 
interactions any solution can be expressed as a linear combination of total- 
spin eigenfunctions. 

Collisions of Identical Particles. The effect of spin on the collision of 
two identical particles (or aggregates) can now be taken into account if 
the interaction between the particles does not involve the spin. Since 
each particle has 2 s + 1 spin eigenfunctions, there are altogether (2s + l) a 
independent spin functions for the pair, each of which is a product of 
one-particle spin functions. 

Any (2s + l) 2 linearly independent combinations of these products 
can be used in place of them. These are conveniently divided into three 
classes. The first class consists of products of one-particle functions in 
which both particles are in the same spin state with S t value raft: 

tq(ra)?; 2 (ra), —s g m ^ s 

where the subscript specifies which of the particles is in each state; 
there are evidently 2s + 1 such states. The second class consists of 
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sums of products 

vi(m')v 2 ( / ni") + Vi (m // )v a (m / ), m' ^ m ,f 

There are s(2s + 1) of these states. The third class consists of dif¬ 
ferences of products 

vi(m')v 2 (m") — vi(m n )v 2 {m , ) 1 m f m" 

Again there are s(2s + 1) of these. 

The first two classes are clearly symmetric in an interchange of the 
spin coordinates of the two particles, and the third class is antisymmetric 
in such an interchange. Thus the total of (2s + l) 2 states can be divided 
into (s + l)(2s + 1) symmetric and s(2s + 1) antisymmetric states. 
Associated with the symmetric spin states must be a symmetric space 
state if $ is an integer (symmetric total wave function), and an anti¬ 
symmetric space state if s is half an odd integer (antisymmetric total 
wave function). Similarly, the antisymmetric spin states multiply 
antisymmetric space states if 2s is even, and multiply symmetric states 
if 2s is odd. We see then that if all the spin states are equally likely to 
appear in a collision, 1 a fraction (s + l)/(2s + 1) of the collisions will be 
described by the wave function (32.9) with the upper sign, and a fraction 
s/(2s + 1) will be described by (32.9) with the lower sign, if 2s is even. 

This and the similar result for 2s odd can be summarized by rewriting 
Eq. (32.10) 

rW = l/W I 2 + I fir ~ 9)|* + 2 Rc[fm* ~ 0)1 (33.2) 

where f is assumed to be independent of 0. 

Equation (33.2) can also be derived by making use of the earlier 
observation that particles that have different spin components are dis¬ 
tinguishable, in which case the interference term in (32.10) disappears. 
This occurs in a fraction 2s/ (2s + 1) of the collisions. In the remaining 
fraction l/(2s + 1) of the collisions, the particles have the same spin 
component, and the symmetric or antisymmetric space state (upper or 
lower sign in the interference term) must be used according as 2s is even 
or odd. 

Electron Spin Functions. In the remainder of this chapter we con¬ 
sider only electron spin functions (s = -g-). The spin matrices are given 
by the first line of Eq. (24.15), and may be written ns S = %hd 9 where 

'• - C o) '• - C ~o) '■ - (o -?) <*>■»> 


1 See footnote 1, pngo 230. 
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are called the Pauli spin matrices , L The normalized eigenfunctions of S» 
may be written in analogy with Eqs. (33.1) as 



and have eigenvalues ift and — ^ft, respectively; they are both eigenfunc¬ 
tions of S 2 with the same eigenvalue fft 2 . 

Since we shall have occasion to write down products of spin functions 
for different electrons, it is convenient to abbreviate the notation as 
follows: 

t>i(i>a(“i)v»(i)t>4(i) = (H-f-+), etc. 


where the first particle has the eigenvalue for aSi 3 , the second has the 
eigenvalue —for S 2t) etc. Si has no effect on the spin functions of 
any but the first particle. 

The following operational rules are easily obtained from (33.3) and 
(33.4): 


*«(+) = ( — ), <tj/(+) = i(~), 

<T*(“) = ( + ), Cl/(—) = — *( + )i 


Ca( + ) = ( + ) 

= -(-) 


(33.5) 


There are four linearly independent spin functions for a pair of elec¬ 
trons: (++), (H—), (— +), (-). These are orthonormal, since the 

one-particle spin functions (33.4) are orthonormal. As remarked earlier, 
it is often convenient to regroup these functions into combinations that 
are eigenfunctions of (Si + So ) 2 and Su + S 2s . It can be verified with 
the help of (33.5) that the following four combinations are orthonormal 
and have the indicated eigenvalues: 


(Si + S 2 ) 2 Si. + fiu 


( + +) 2ft 2 ft 

2 ■*[(+-) + (-+)] 2ft 2 0 

(--) 2ft 2 -ft 

2 —) ~ (~+)] 0 0 


(33.6) 


It is interesting to note that the first three of the two-particle spin 
functions (33.6) together behave in all respects like a single “particle” 
of spin 8 = 1, and the last of the spin functions (33.6) behaves like a 
single “particle” of spin s = 0. 2 Nob only do they have the proper 
eigenvalues of the square of the magnitude of the total spin and the z 
component of the total spin, but the result of operating on the triplet 
spin function with the x or y components of the total spin is in agreement 

1 W. Pauli, ZcilH. f Physik, 43, 001 (1927). 

2 The first, three states are (willed a triplet and the last a xinylct. In the old quantum 
theory, the triplet corresponds to parallel electron spins and the Hinglet to antiparallel 
sphiB. 





228 


QUANTUM MECHANICS 


[Chap. IX 


with the corresponding matrices in the second line of Eq. (24.15). This 
provides an example of the addition of angular momenta; according to 
Sec. 24, the combination of two systems of angular momenta results in a 
system of angular momentum either 1 or 0. 

The Helium Atom. The ground state of the helium atom was con¬ 
sidered from the point of view of the variation method in Sec. 27. We 
now consider the ground and first excited states of helium with the help 
of the somewhat simpler first-order perturbation theory of Sec. 25; the 
sy mm etry effects of the spins of the two electrons are taken into account, 
although spin-dependent forces are neglected. We use products of 
hydrogenic wave functions u n un (with Z = 2) as the unperturbed eigen¬ 
functions of the problem, and are interested in classifying the states 
according to symmetry and spin properties rather than in obtaining 
accurate energy levels. 

In spectroscopic notation, the ground state of helium is the Is 2 state: 
both electrons are in the hydrogenic state Wioo. Since this space state is 
symmetric, the spin state that multiplies it must be the antisymmetric 
singlet given as the last of the functions (33.6), for which the total spin 
is zero. 

The space part of the first excited state of helium is eightfold degener¬ 
ate in the zero-order approximation. The spectroscopic configurations 
are ls2s and ls2p. Apart from electron exchange, the first state is non¬ 
degenerate and the second is triply degenerate (because of the three 2p 
states); the exchange degeneracy doubles the number of states, since 
either electron can occupy the Is state and the other the 2s or 2p state. 
In order to simplify matters, we consider here only the doubly (exchange) 
degenerate ls2s state; it is not difficult to show that the ls2p states can 
be treated separately (see Prob. 7). 

The perturbation energy is the electrostatic repulsion between the 
electrons e 2 /V 12 , and the unperturbed states are ^loofaO^oofo) and 
Wioo(r 2 )w 2 oo(ri). The spin need not be considered explicitly at this point 
since the spin-dependent forces are neglected; appropriate spin functions 
will be multiplied in later to make the entire wave function antisym¬ 
metric. The matrix of the perturbation for these two states has the 
structure of (25.16) and can be written 


where 



J = 
K = 


j J Wioo(r 1)^200^2) 

j j Wioo(ri)W20o(f2) 


6 2 

— Mioo(ri)i£ 2 oo(r2)drid72 
' 12 

e 2 

— ^100(^2)^200^1)^1^72 


(33.7) 


(33.8) 


J is often called the direct or Coulomb energy , and K the exchange energy . 
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Application of the diagonalization technique of Sec. 25 (see the treat¬ 
ment of the first-order Stark effect in hydrogen) shows that the eigen¬ 
values of the perturbation (33.7) are / + K and J — K) they correspond 
to the normalized eigenfunctions 2~ 4 [t£ion(ri)w 2 oo(r 2 ) + ^ioo(r 2 )w 2 oo(ri)] 
and 2 - *[wi()o(ri)u 2 or>(r2) — Uioo(r 2 )M 2 Qo(ri)], respectively. Since the first 
of these is a symmetidc space function, it must be multiplied by the anti¬ 
symmetric singlet spin function. Similarly the second, which is an anti¬ 
symmetric space function, must be multiplied by one of the symmetric 
spin functions that make up the triplet in (33.6). Since K turns out to be 
positive, the singlet spin state has a substantially higher energy than the 
triplet spin states. This is not due to a spin-dependent interaction, but 
to a coupling between the spins and the electrostatic interaction that is 
introduced by the exclusion principle (use of antisymmetric wave 
functions). 

Spin Functions for Three Electrons. In the treatment of exchange 
scattering from helium that is given in the next section, we shall require 
eigenfunctions of the total spin of three electrons that are analogous to 
those given in Eqs. (33.6) for two electrons. We can regard three elec¬ 
trons as 1 + 2 electrons, in the sense that we can combine an electron 
(s = ■£■) with the triplet two-electron function (s = 1) and with the singlet 
function (s = 0). In the first case, the results on addition of angular 
momenta, given in Sec. 24, show that we should get two groups of spin 
functions for the three electrons that correspond to s = % and s = in 
the second case we should get a single group of three-electron spin func¬ 
tions that correspond to s = We thus expect one quartet group of spin 
states (s = $) and two distinct doublet groups of spin states (s = 4), 
or a total of 4 + 2 + 2 = 8 individual three-electron spin states. These 
must of course be expressible as linear combinations of the 2 3 = 8 
products of one-electron spin functions. 

It is not difficult to show that the following eight combinations are 
orthonormal and have the indicated eigenvalues: 

(S t + S 2 + S ,,) 2 Si* + Sim + £3- 


(+ + +) 

VA* 


3“ 1 [( + + -) + (+-+) + (- + +)] 

yA a 

itL 

3-*[(-h) + (- + -) + (+--)] 

J .fA s 

-» 

(-) 

i/A J 


6“*[(+ + -) + (+-+) - 2(—1—hJ] 

2A 8 

\1t 

6 -i K--+) + (- + -)- 2(H-)] 

w 

-tt 

2-*[(+ + -) ~ ( + -+)] 

W 

\ h 

2-*[(-h) - (- + -)] 

jA a 



The first four (quartet,) states are symmetric in the interchange of any 
pair of particles. The division of the four doublet states into two pairs 
is arbitrary, and is done here in such a way as to make the first pair of 
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doublet states sy mm etric in the interchange of particles 2 and 3, and the 
second doublet pair antisymmetric in 2 and 3. As they are written, the 
doublets have no symmetry with respect to interchanges of the other pairs 
of particles. 


34. REARRANGEMENT COLLISIONS 

Cross sections for elastic and inelastic collisions of electrons with 
hydrogen atoms were calculated in Sec. 30 by means of perturbation 
theory, under the assumption that exchange of the incident and atomic 
electrons can be neglected. In this section, we consider the effects of 
electron exchange taken together with spin and the exclusion principle, 
but continue to use perturbation theory, which is most useful for high- 
energy collisions. 1 We first consider a general rearrangement collision 
by means of the Born approximation of Sec. 26, then show the connection 
between this method and the time-dependent perturbation theory of 
Sec. 29, and finally apply the theory to exchange collisions of electrons 
with hydrogen and helium atoms. 

Notation for Rearrangement Collisions. A general binary rearrange¬ 
ment collision can be described as an event in which a system A in state 
m collides with a system B in state n, and systems C in state s and D in 
state t emerge. It is assumed that the same particles make up the sys¬ 
tems A } B as make up the systems C,D (no particles appear or disappear 
and no photons are involved), although the particles are rearranged dur¬ 
ing the collision. We use the letters to denote all the internal 

coordinates (including spins) of the respective systems, r^ and x cd to 
denote the vectors that connect the centers of mass of the systems 

A,B and C,D, respectively, and and ~ 

to denote the reduced masses associated with the relative motion before 
and after the collision. The entire calculation is performed in the center- 
of-mass system; the transformation to the laboratory system can be 
effected by means of the general results of Sec. 18. 

It was shown in Sec. 32 that a calculation of this type can be carried 
through as though the particles are distinguishable. At the end, a linear 
combination of the exchange-degenerate wave functions is formed that 
has the proper symmetry in each group of identical particles. The sym- 
metrization will be left for the specific examples given near the end of this 
section. For the general problem considered here, we obtain only an 
approximate unsymmetrized wave function. 

1 For a discussion of other methods applicable to lower energy collisions, see N. F. 
Mott and H. S. W. Massey, “The Theory of Atomic Collisions” (Oxford, New York, 
1933). 
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We wish to solve the wave equation 

(H - E)t = 0 (34.1) 

where the Hamiltonian can be written in either of two ways: 

H = H ab + HU = H ed + H'^ (34.2) 

The unperturbed Hamiltonians for the initial and final systems are 


H* = H a + H b + Tab, 


Ha„ = Ha + Hi + Tat, 



—— v 2 k 

2Mab ah 
2 M ed td 


(34.3) 


where the T’h represent the kinetic energy operators for the relative 
motions in the center-of-mass system. The unperturbed states of the 
initial and final systems are (known) solutions of the wave equations 

(Ha “ E am )Uam(A) = 0, (H b - E hn )u bn (B) = 0 .v 

C He - E 0 „)u 0 a(C) = 0, (H d - E dt )u dt (D) = 0 

The interaction terms H ^ and H' cd are regarded as small perturbations. 

It is always possible to expand the exact solution in the complete 
orthonormal set of functions ?/ CH (C r )7/, di (D), where the expansion coef¬ 
ficients are functions of the relative coordinate r^: 


4* — ^ Uoa(O^Udt(E)^at(^c<{) 

ttjt 


(34.5) 


Our problem consists in finding approximate expressions for the functions 
v B t(*cd) that correspond to internal final states .s and t for the systems C 
and D, and that arise from the unperturbed initial state 


^ ( , = u ttm (A)it bn (B) exp (tk 0 ■ r^) 
2 M ah 


k\ - 


A s 


(E - E ntn - Ehn) 


(34.6) 


Use of the Bom Approximation. Substitution of yf/ from Eq. (34.5) 
into the wave equation (31.1) yields, with the help of (3*1.2), (3*1.3), and 
(34.4) 

£ Hr.{C)u dt {l))(T cd + li r . + Mu - li)v Hl {r,i) = -//^ (34.7) 

If now Eq. (34.7) is multiplied through on the left, by Uca'(C)u<u f (D) and 
integrated over all the coordinates of C and I), the orthonormality of the 
u ’s causes all the terms on the left side to vanish except that for which 
s = s' and t = //. We drop the primes and write this as 

(Trd + Era + ~ E)vMrd) = ~ J 0 **&) H^M* (34.8) 
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Equation (34.8) can be written in a form that is analogous to Eq. (26.4): 

(-V*, - £ 2 Hi(r«i) = “ / / u c .{C)u it {D)H' cd Hr c dr d 

IM, 


* 2 = ^ (E - E" - E dt ) 


(34.9) 


Equation (34.9) with all s and t represents a sequence of exact equa¬ 
tions that can in principle be solved for the functions v, t . This situation 
is similar to that encountered in Eq. (26.4), where we obtained an approxi¬ 
mate solution by replacing the exact by the unperturbed solution on the 
right side. We now obtain an approximate solution of (34.9) by replac¬ 
ing -p by the 1^0 given in (34.6); then the right side is known, and the 
inhomogeneous equation is readily solved for v, t by means of an appropri¬ 
ate Green’s function. The substitution of p'a for is equivalent to the 
assumption that there is very little interaction between the unperturbed 
initial systems A and B. This implies not only that the transition 
—*■ C,I) has a small probability, but also that \j/a is a good approxi¬ 
mation to the actual wave function even when the two systems .1 and B 
are close together or overlap. In practical cases, it is difficult to set up a 
workable criterion for the validity of this approximation, although useful 
results are likely to be obtained when E is large in comparison with all 
the interaction energies that appear in H'^. 

With the help of the Green’s function (26.15), the solution of the 
inhomogeneous equation (34.9), with + replaced by becomes 

«'.<(ri C i) = ~ J J J 14 - *cd\~ l [exp (*|4 - r rrf |)] 

• u c ,(C)u dt (D)H' cd u am (A)u bn (B)[exp (ik 0 • r ah )]d.TjTdTc,i (34.10) 

The integration in (34.10) is overall the unprimed coordinates; the 
element of integration can be represented either as dr^hudr^ or as 
dTodTbdTri, and is abbreviated in what follows as dr. 

The asymptotic form of (34.10) when systems C and D are well 
separated is 


v »t( r cd) * 9at(Q,<l>)T r c d 1 C* Ar, « i 

r'ed —» eo 

c 

9>t{0,4>) = - <^2 J u cs(0)u dt (D)[exp (~ik-i cd )] (34.11) 

• ^w aB ,(4.)w 6 „(B)[exp (fk 0 • r a <>)]cfr 

th6 F°l ar 6116168 0f tLe vector 4, and k is a vector that 

^ i° n T d u 6 magnitude ® ven b y Eq. (34.9). Equation 
( . ) is normalized so that the incident flux of systems A and B is the 
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initial relative speed »o = hho/Mab, and Eqs. (34.5) and (34.11) are 
normalized so that the radial outgoing flux of systems C and D per unit 
solid angle is v\g, t (6,<l>)\ i , where the final relative speed v = hk/M ed . 
Thus the differential cross section for the collision A,B —> C,D is 

<r-(M) = ~ MM)I 2 (34.12) 

vq 

Lack of Orthogonality of Initial and Final States. There is an arbitrar¬ 
iness in the expression (34.12) for the cross section that arises from the 
fact that the wave function for the initial state is not in general ortho¬ 
gonal to the function ^7 = u oa (C)u dt (D) exp (zk • r^), the complex con¬ 
jugate of which also appears in the expression for niay be 

said to describe a final state in which the systems C and D are observed 
to be moving in the direction 0 ,<£. Since the initial and final wave func¬ 
tions are eigenfunctions of different unperturbed Hamiltonians H a i and 
H ed) respectively, they are not expected to be orthogonal to each other. 
If they are not orthogonal, the addition of a constant potential energy 
(which corresponds to zero force) to H' cd alters the expression for 
such a change in H' cd could be made by adding an arbitrary constant mul¬ 
tiple of ^ to both sides of Eq. (34.7), thus also changing the magnitude of 
E , We avoid this arbitrariness by defining H' od as the energy of inter¬ 
action between systems C and D that vanishes as r ed becomes infinite; 
the additive constant is then fixed uniquely. A similar definition is 
made for H 

It is interesting to note that H f ah can be used in place of H ^ in the 
integral for g 8t ( 0 ,<£). This integral is with the help of Eq. 

( 22 . 10 ) it can be transformed as follows: 

W/H'^dr = 

= j[{II — Hc<i)'h\ x l'ud T = 

where we have made use of the relation H od ^f = E&j- In similar fashion, 
can be shown to bo equal to the last expression by making use 
of the relation H (l b^n = Ei/'o. We see then that 

ShUMr = ShlfLMr (34.13) 

Equation (34.13) has, for example, the consequence that = 0 

if H f a 5 = 0 , even if H f cd is not zero and the initial and final states are not 
orthogonal. This result is to be expected, since if /7 a & = 0, there is no 
interaction between the colliding systems A and B and the transition 
does not occur. 

Connection with Time-dependent Perturbation Theory. It is also 
possible to derive Eq. (34.12) by the method of variation of constants 
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(Sec 29) We expand the wave function + in unperturbed final-state 
wave functions h - u e .(C)u dt (D)ex p(& • i*), where the subscript / 
stands for the states s and t of systems C and D and for the relative 
propagation vector k: 

\[/ = ^ a f (t)\l/ f e A (34.14) 


The time-dependent wave equation is 


ih d 4 - H+ = (ff* + HU)* 


(34.151 


Substitution of (34.14) into (34.15) gives 

^ ^ ft TT\ 

ih ) dft/e * = £ o-{H^fe 


iEft 

h 


( 34 . 10 ) 


where use is made of the relation (Hod — E/)^/ — 0. Equation (34.16) 
can be simplified by multiplying through on the left by #/' and int egrating 
over all the coordinates; since the \[/ f are orthonormal, wo obt ain 


i(E/ r — Ef)l 

fyf'H'd'I'fdr ’ c h ( 34 . 17 ) 

/ 

The system of Eqs. (34.17) is exact. We now make two approxima¬ 
tions, which together are equivalent to the Born approximation Hiib- 
stitution of for on the right side of Eq. (34.9). First, wo assume* t hat 
the perturbation H' d is small; because of Eq. (34.13), this is equivalent 
for our purpose to the Born approximation assumption that II' h is small. 
Then we can insert the unperturbed amplitudes on the rigid. side of 

(34.17) , and calculate the first-order perturbed amplitudes a 1 }? on the 

left side. Second, we assume that the initial state fae h can be expanded 
in terms of only those (degenerate) 1/7 whose energies Ef are equal to the 
initial energy E. This assumes that is an eigenfunction of the* final 
unperturbed Hamiltonian H cd , which is equal to H uh + ll { [ h - IIU\ 
since $ 0 is actually an eigenfunction of H ab , this also is equivalent to the 
assumption that the perturbations H ^ and H' d are small. 

We can then replace Ef by E in the time factor on the right- side of Eq. 

(34.17) and take this factor outside of the summation over/. The 1111 per- 

turbed amplitudes a ( f are defined by V-o = ^ which gives 

= ftfiadr 
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With, the help of (34.18), the summation over/can be rewritten 

, ^ f h'Htd'I'fdr J Modr = J ^ r H' d \f/ 0 dr 

f 

where use has been made of the closure relation for the complete ortho¬ 
normal set of functions \f/ f . Thus Eq. (34.17) becomes, to first order, 

iha { }) = fyf'H^odr - e h (34.19) 

Equation (34.19) can be handled in precisely the same way as Eq. 
(29.7), and yields the differential collision cross section (34.12). 

Exchange Collisions of Electrons with Hydrogen. As a simple first 
example of a rearrangement collision in which effects of identity and spin 
appear, we consider the elastic scattering of an electron from a hydrogen 
atom. In a problem of this type, we must know the asymptotic forms of 
the unsymmetrized wave function for all permutations of identical par¬ 
ticles. 1 A wave function that has the proper symmetiy character can 
then be constructed by the methods outlined in Sec. 32. We first obtain 
the asymptotic form of the wave function when the incident electron is 
scattered and when the incident electron exchanges with the atomic elec¬ 
tron, to the accuracy of the Born approximation; spin-dependent inter¬ 
actions are neglected. 

The incident and atomic electrons are denoted by 1 and 2, respec¬ 
tively, in the unsymmetrized wave function. The asymptotic form of 
the stationary wave function ^(ri,ro) that corresponds to nonexchange 
elastic scattering with the total energy E is a product of the ground- 
state hydrogen wave function ^ioo(r 2 ) for electron 2, and an incident- 
plane plus outgoing-scattered wave for electron 1: 

^(r!,r 2 )-> [exp (*k„ • r,) + rr^^'/WOlttioflfr*) 

h-kl _ me* ( 34 - 2 °) 

2 m + 2ft 2 

The work of Bees. 20 and 30 shows that the scattered amplitude has the 
form 

g(6l) = " W f / r ° xp ( ~ * • r ')]“- 1( .(r 2 ) 

■ (exp ik o • ri)wi,H,(r 2 )dridTa (34.21) 

where k is a vector of magnitude /f« that has the direction 6 1 (/ does not 
depend on the other polar angle <£i). 

1 J. R. Oppenheimer, Phya. Rev., 32, 361 (1028). 
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The asymptotic form of ^(ri,r s ) that corresponds to elastic exchange 
scattering is a product of the ground-state hydrogen wave function ■Mioo(ri) 
for electron 1, and an outgoing scattered wave for electron 2. 

lKri,r 2 )-*■ rj 1 e ik ' r 'g(6 i )u 10 o(ri) (34.22) 

ra—* « 

There is no plane wave in this case, since electron 2 is not incident on the 
atom. In accordance with Eq. (34.11), the exchange scattered amplitude 
is 

g{ei) = ~ 2S* / / t ex P(~* * - £) 

• (exp &o • ri) v> ioo(^a) dridr 2 (34.23) 

where k has the magnitude k 0 and is in the direction 0 2 . 

We must now form an antisymmetric wave function from products of 
^(ri,r 2 ) and appropriate spin functions. The spin functions can be taken 
to be the set of four given after Eq. (33.5); however, it is simpler to make 
use of the four symmetrized combinations (33.6). The spin of the inci¬ 
dent electron is not assumed to have any definite relation to the spin of 
the atomic electron. In this case we can use either of these sets of spin 
functions, calculate the scattering with each of the four spin states of a 
set, and then average the results with equal weights for each state. 1 The 
first three of the spin functions (33.6) are symmetric, and must be multi¬ 
plied by the antisymmetric space function ^(ri,r 2 ) — ^(r 2 ,ri); the fourth 
spin function is antisymmetric, and must be multiplied by 


^(ri,r 2 ) 4- ^(r 2 ,rO. 


Differential Cross Section. The asymptotic forms of the symme¬ 
trized space functions for large values of one of the electron coordinates, 
say r h are obtained from (34.20) and (34.22). 


1 This is a consequence of the fundamental hypothesis of quantum statistical 
mechanics; see, for example, R. C. Tolman, “The Principles of Statistical Mechanics, 
Sec. 84 (Oxford, New York, 1938). It can be shown that either of two complete 
orthonormal sets of wave functions can be used in such a statistical calculation (in the 
present problem, the two sets are complete so far as the spins of two electrons are 
concerned). The two sets, say v n and Uh, were shown in Sec. 22 to be connected by a 


unitary transformation: v n — where S is a unitary matrix. Then 

k 

^Kl 2 = y 8I^tJSro'nUk^iJ B , = y SklfUbilk' 53 
» nfok’ kjc f k 


Since the probability of observing a given event (such as the scattering in a particular 
direction) is proportional to the square of a wave function, the same average result is 
obtained from a statistical mixture of either set of wave functions. 
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^(ri,r a ) ± ^(r 2 ,ri) 

-► [exp (*k 0 ■ rO + i) ± 7- 1 e*‘^(0i)]w 1(00 (r 2 ) (34.24) 

n—►» 

The first two terms in the bracket on the right side of (34.24) come from 
the first term on the left side, and the third term on the right side comes 
from the second term on the left side. The differential cross section must 
be computed with the upper sign in one quarter of the collisions, and with 
the lower sign in three quarters of the cases. We thus obtain 

*(«) = *i m + m*+ to) - m* (34.25) 

Equation (34.25) can also be derived without explicit reference to 
the spin wave functions, as was Eq. (33.2), by making use of the earlier 
observation that particles that have different spin components are 
distinguishable. In half the collisions, the electrons have different spin 
components, and the cross section is just the sum |/(0)| 2 + \g(9)\ 2 of the 
direct and exchange cross sections; in the other half, the electrons are 
indistinguishable, and the antisymmetric space function must be used. 
We thus obtain 

*(«) = *(!/(*)I 2 + 1(7(0)I 2 ) + *1/(0) - (7(0)I 2 

which is easily seen to be the same as (34.25). 

An integral of the form /(exp ik ■ T)F{x)dr is small if 7m » 1, where it 
is assumed that F is a smooth function of r that becomes small for r > a. 
Since both the iq and r 2 integrals in (34.23) are of this type, we expect 
g to be quite small in comparison with/for ka o » 1. This is the situation 
in which the Horn approximation is most applicable, so that the correc¬ 
tions due to exchange are expected to be fairly small for the cross sections 
calculated in Sec. 30. 

Exchange Collisions with Helium. In dealing with the elastic scatter¬ 
ing of an electron from a helium atom in its ground state, it is convenient 
to work with the space and spin wave functions together. According 
to the discussion of Sec, 33, the two electrons in the helium atom are in a 
symmetric space state and an antisymmetric (singlet) spin state. Thus 
if the incident electron is denoted by 1 and the atomic electrons by 2 and 
3, the unperturbed wave function is (exp *k 0 ’ ri)'W(i(r 2 ,r 3 )y(l,2,3), where 
Mo is the symmetric space function for the normal state, and v(l,2,3) 
is a spin function that is antisymmetric in 2 and 3. The eight spin 
functions for three electrons are grouped in (33,0) according to symmetry 
in 2 and 3; it is apparent that v(l,2,3) must be one of the last doublet pair 
given there. 

The asymptotic forms of the first-order perturbed wave function 
including spin are found to be 
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^(1,2,3)-> [(exp tk 0 • ri) + ?T 1 /(^i)] M o(r2,r s )«(l,2 ) 3) 

ri—► oo 

—■+ »?V(®*)Mo(r8^0»(2,M) (34-26) 

rs~» oo 

-**7V(0>)«o(ri,rs)»(3,l,2) 

f3-► 80 

where there is antisymmetry only in electrons 2 and 3. Here 
f(9i) = -gS* / / / ^xp (—ik • ri)]fio(r a ,r 3 ) 

• ( -—H -— J (exp zk 0 * ri)Wo(r 2 ,r s )dTidr 2 drj 

v 12 7*18 ^1 / 

and a spin term v*(l,2,3)v(l,2,3) = 1 has been omitted. Also 
if (9 2 ) = (7(^>*(2,3,1M1,2,3) 5 

= ~ 2^2 /// [ ex P (“* • r «)K(r 3 ,ri) (34.27) 

/ e a e 2 2e 2 \ 

* (-1-) (exp&o * ri)tto(r2,r 3 )r/Tidr 2 rfT3 

V 12 r 28 r 2 / 

with a similar expression for The product of spin fimctions in 

(34.27) is readily evaluated by making use of the one-electron functions 
(33.4) and remembering that v* is the Hermitian adjoint of v* We take 
for u(l,2,3) the next to the last spin function of (33.9), and obtain 

V *(2,3,1M1,2,3) = 2-*[(~ + +)* - (+ + -)*] 

.*-*[(++-) -(+-+)]= -* (34.28) 

The completely antisymmetric wave function derived from *(1,2,3) 
is given in Eq. (32.3) with the lower sign. Since * is already antisym¬ 
metric in its last two arguments, it is apparent that the second bracket- 
terms in (32.3) duplicate the first bracket terms. The asymptotic form 
of the wave function for large values of one of the electron coordinates, 
say r h is then obtained from (34.26) and (34.28); 

*(1,2,3) + *(2,3,1) + *(3,1,2) —-♦ 

{(exp *k 0 ■ rO + r r VM/(00 -lg(0 1 ) ~ 4flf(«i)])tto(r.*)Kl,2,3) (34.29) 

The differential cross section obtained from (34.29) is 

= I m - 9W (34.30) 

Like Eqs. (33.2) and (34.26), Eq. (34.30) can be derived without 
explicit reference to the spin functions. Since the two atomic electrons 
must have antiparallel spins (singlet state) in order for the helium atom 
to be in its ground state, the spin component of the incident electron 
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is the same as that of one of the atomic electrons and different from 
that of the other. It cannot exchange with the latter in an elastic 
collision, since then both the resulting atomic electrons would be in the 
same spin state and the exclusion principle would force the atom into an 
excited state. Thus it can only exchange with the electron with which 
it is indistinguishable, so that the antisymmetric combination of direct 
(/) and exchange (g) amplitudes must be used; this gives Eq. (34.30). 

In the absence of spin-dependent interactions, the excitation of a 
triplet state of helium by electron impact can be accomplished only by 
exchange between the incident electron and one of the atomic electrons. 
In this case, there is no direct (/) amplitude, and hence no interference 
between direct and exchange amplitudes. 

Problems 

1. Show that the antisymmetric wave function given in Eq. (32.7) vanishes if 
there is an identical linear relation between the functions v a , vp, . . . , v v . 

2. Show that if a wave function w(l,2, ... n) is nil energy eigenfunction of a 
symmetric Hamiltonian that corresponds to a nondegenerate eigenvalue, it is either 
symmetric or antisymmetric. Show this first for n =■ 2, then for n « 3, and then 
indicate how the proof can ho extended to arbitrary n. 

3. Verify that the spin wave functions given in Eq. (33.6) are eigenfunctions of 
(Si + S-j) 2 and S\ t with the indicated eigenvalues. Show also that the result of 
operating on these functions with the x and // components of the total spin is in agree¬ 
ment with the appropriate matrices given in Eqs. (24.15). 

4. Carry through the calculations of Prob. 3 for the spin functions given in Eq. 
(33.9). 

5. Find the eigenfunctions of the square of the magnitude of the total spin and 
the z component of the total spin of four electrons, and show that they can be grouped 
into one quintet, three triplet, and two singlet states. (Hint: Start with the triplet 
and singlet spin functions Tor two pairs of two electrons, and make us('. of the matrices 
(24.15) together with the corresponding set for j = 2.) 

6. Use Eq, (33.2) to write down an expression for the scattering of protons in the 
center-of-nuiss coordinate system, assuming that the Coulomb interaction extends in 
to r « 0. Discuss the classical limit of the cross section (h ->0), particularly in the 
neighborhood of 0 = 90°, and show that the interference term drops out. if the average 
scattering over an arbitrarily small Imt. finite range of angle is computed. 

7. Show that the 1 «2p configurations in helium can be treated separately from 
the 1«2« configurations so far as the first-order energy-level calculation of Sec. 33 is 
concerned. 

8. What would be the unperturbed ground-state wave functions of helium if each 
electron had spin angular momentum A and obeyed Einsloin-Hose statistics? 

9. Write down the unperturbed ground-stale wave function for a neutral lithium 
atom. 

10. Show by direct calculation that Kq. (3-1.25) is obtained if the incident and 
atomic electrons arc assumed to be described by the four spin wave functions ( + 4-), 

(H—), (—H, and (-), rather than by the triplet and singlet combinations (see 

footnote 1, page 236). 



CHAPTER X 

SEMICLASSICAL TREATMENT OF RADIATION 


No account has thus far been given in this book of the interaction 
between material particles and electromagnetic radiation. As would be 
expected, a treatment that is consistent with the foregoing theoxy of 
material particles requires that quantum equations of motion of the 
electromagnetic field be found that are analogous to Maxwell’s equations. 
Indeed, it is only in this way that Planck’s original quantum hypothesis 
can be fitted into a general theoretical framework. The development of 
the elements of a quantum theory of radiation will be postponed until 
Chap. XIV. In the present chapter we treat the electromagnetic field 
classically and the particles with which the field interacts by quantum 
mechanics. Such a semiclassical treatment is bound to be incomplete 
and not wholly satisfactory, although it is simpler in principle than the 
quantum electrodynamics presented in Chap. XIV. We shall find that 
it is possible in this approximate way to give a plausible and correct 
account of the influence of an external radiation field on a system of 
particles (absorption and induced emission), but not of the influence of 
the particles on the field (spontaneous emission). Nevertheless, the 
results of the classical treatment of the latter phenomenon can be con¬ 
verted to quantum theory in a correct, if not very convincing, manner. 
Some simple applications of the theory are given in Sec. 37. 


36. ABSORPTION AND INDUCED EMISSION 
The Schrtidinger wave equation for the motion of a particle of mass m 
and charge e in an electromagnetic field described by the potentials 
A> <t>, with an additional potential energy V, is obtained by adding a tei'm 
7^ to the right side of Eq. (23.24). 


•* W f 


v 2 + — A ■ 

2m ^ me 


grad + m (div A) 


+ w A2+e *+ F k t 36 - 1 ) 


We regard V as the potential energy that binds the particle (of electro¬ 
static origin if the particle is an electron); A,<£ represent an electromag¬ 
netic field that is weak enough so that those terms can be regarded as a 
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perturbation. Our object is to calculate the probabilities of transitions 
between stationary states of the particle in the potential energy V , that 
are produced by the field. We first discuss some properties of the field 
and its plane wave solutions. 

Maxwell’s Equations. Maxwell's equations of motion for the elec¬ 
tromagnetic field are, in Gaussian units, 


c , 1 dH n 

curlE + - c -dt =0 ’ 


curl H — - ^ — J 

c dt c * 


div E — Airp, div H = 0 


(35.2) 


If the divergence of the second of these equations is combined with the 
time derivative of the third, we obtain the equation of continuity for the 
electric charge and current densities p and J 


divJ+l-0 


(36.3) 


The electric and magnetic field strengths can be expressed in terms of the 
potentials by Eqs. (23.15). 

1 r) A 

E = - - - grad 4 , H = curl A (36.4) 

C at 


which cause the first and fourth of Eqs. (35.2) to be satisfied identically. 
The potentials are not defined uniquely by Eqs. (35.4), since any A,<£ that 
give the correct E and H can evidently be replaced by new potentials 
A',<£' without altering the field strengths, where 

A' = A + grad x, ^ = ^ (35.5) 


and x ia an arbitrary function of r and t. 

Substitution of (35.4) into the second and third of Eqs. (35.2) gives 


, , A , 1 3 2 A . 1 , c)</> 4tt T 

curl curl A + + -grad — = — J 


op 

i “ div A + V 2 <j> = — 47r/o 
c at 


(35.6) 


If the vector A is written in rectangular coordinai.es, we can put 
curl curl A = grad (div A) — V 2 A, 

where the last term is the vector whose components are the Laplacians of 
the components of A. We can therefore simplify Eqs. (36.6) by making 
a gauge transformation (35.5) from A,</> to A',4>' such that the new poton- 
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tials satisfy the Lorentz condition: 

div A' + i ~ = 0 
c at 

The gauge function x then satisfies the equation 

V 2 X -I*!* = — (div A + - —) 
V x c 2 dU \ Q1V A r c dt) 

Equations (36.6) then become 


w _ i 

c 2 dt 2 
v c 2 dt 2 


= —47rp 


(36.7) 


(35.8) 


(35.9) 


Plane Electromagnetic Waves. If J = 0 and p = 0 (completely 
empty space), it can be shown that it is possible to choose the gauge 
function so that div A' = 0 and <f>' = 0 for all r and /, without loss of 
generality (see Prob. 1). Then transverse plane wave solutions can be 
found for A', and hence also for E and H. We drop the primes and have 
in this case 

v 2A- 4 2 ^ = °> divA = ° ( 36 - i() ) 


A typical plane wave solution of (35.10) is one that represents a real 
potential with the propagation vector k and the real polarization vector 
|A 0 |: 

A(r ,t) = 2|Ao| cos (k • r — ut + a) 

= A 0 [exp i(k • r — ut)] + c.c. (36.11) 


Here “c.c.” denotes the complex conjugate of the term that precedes it, 
and the constant complex vector A 0 is defined to be \A 0 \e ia . The first of 
Eqs. (35.10) is satisfied if w = kc , where k is the magnitude of k, and the 
second is satisfied if A 0 is perpendicular to k. 

The electric and magnetic fields associated with the vector potential 
(35.11) are 

E = —2/r|A 0 | sin (k • r - at + a) 

H = —2k X |A 0 | sin (k • r — at + a) 

The Poynting vector (c/47r)E X H is evidently in the direction of k; its 
magnitude averaged over a period 2?r/a) of the oscillation is 
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where |A 0 | 2 is equal to the scalar product of |A 0 | with itself (|A 0 | • |A 0 |) or 
the scalar product of A 0 and its complex conjugate (A 0 • Ao). Equation 
(35.12) is the intensity associated with the plane wave (35.11). 

Use of Perturbation Theory. We now return to Eq. (35.1), and use 
it to calculate the probability of a transition between stationary states 
that is produced by the vector potential (35.11), which is regarded as a 
small perturbation. The third term (div A) and fifth term (<£) on the 
right side of (35.1) are now zero. The ratios of the second to the first 
term and the fourth to the second term on the right side of (35.1) are of 
order eA/cp ) where p is the momentum of the particle. The magnitude 
of this quantity is estimated in a practical case in Prob. 2, and is so small 
that the perturbation approximation is justified. Thus to the first order 
of perturbation theory, we can neglect the term e 2 A‘ 2 /2me 2 , and rewrite 
Eq. (35.1) 

ih *£ = (H 0 + H')t 

h* ieh ( 35 ‘ 13 > 

Hn = - f- V* + 7(r), IV - — A • grad 
2 m K n me * 


We proceed as in Sec. 29, and expand ^ in stationary eigenfunctions 
w*(r) of the unperturbed Hamiltonian II ( > with time-dependent coeffi¬ 
cients a,k(t). If the system is initially in the state n and the perturbation 
is turned on at t = 0, the first-order amplitudes at time t are given by an 
expression similar to (29.17). 


JI ^0 fj/(w£«—t*))i — "1^ ^ 


aVV) = - -- 

n Oi/nt — 0 ) 


mi 


k Win + CO 

= — / tfrtexp 2 k • r)A 0 • grad u n dr 
me J 

flkn = ^ J «*[** p C-fk • r)]Au • grad u n th 


(35.14) 


As discussed in Sec. 29, the probability of finding the system in the 
state k is appreciable only when the denominator of one or the other of 
the two terms in (35.11) is practically zero. There is no interference 
between the two terms: the first is important when E k ^ E n + ftu, and 
the second is important when E k = 2£» — fiw. Thus the probability of 
finding the system in a state k that has an energy higher than the initial 
state by about hu is proportional to and the probability of finding 

the system in a state k' that has a correspondingly lower energy is pro¬ 
portional to 

Transition Probability. The discussion of Sec. 29 shows that the 
transition probability per unit time is independent of the time only if 
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the final state can be any of a very closely spaced or continuously dis¬ 
tributed group. The need for a group of final states arises from the 
dependence of the probability (i)| 2 on the energy, which is shown in 
Fig. 27; it is the area under this curve, not the ordinate at a particular 
abscissa, that is proportional to t. 

In the same way, a constant transition probability per unit time is 
obtained in the present problem if the incident radiation is monochro¬ 
matic (definite value of w) and transitions can occur to any of a group of 
closely spaced or continuously distributed final states. The result is 
Eq. (29.12) with either H'& or 2T#° substituted for H ' km . However, the 
computation of a transition probability between two discrete states is 
often of interest. In this case, the transition probability per unit time 
is not constant in time, if the incident radiation is strictly monochromatic, 
and depends markedly on the difference between w and 

|wjb„| = | E h — E n \/h. 

What we do in this case is to assume that the radiation covers a spread of 
frequencies with no phase relations between the different frequency 
components, so that the radiation can be characterized by an intensity 
per unit frequency range that is constant in the neighborhood of \ukn\- 1 

The probability of finding the system in the final state is then propor¬ 
tional to iJTJfcnl 2 or which in turn is proportional to |A 0 | 2 and hence 

to the intensity. If the intensity in the small angular frequency range 
Aw is J(w) Aw, Eq. (35.12) tells us that we can put 

|A 0 | 2 = ^/(co)Aw (35.15) 

where A 0 is the vector potential amplitude that characterizes the fre¬ 
quency range Aw. The probability that a transition in which the system 
is left in a higher energy state ( E k ^E n + hco) has taken place at the 
time t is then 


LdjMii _ V sin 2 £(«*» - u)t 
1 * wi 4 hKwkn ~ w)2 




Ui(exp tk • r) grad^ndr 


. «)* (35.16) 

(W*n ~ «) 


1 For a discussion of the situation in which the intensity is not constant near |wjbn|. 
see W. Heitler, “The Quantum Theory of Radiation,” 2d ed., Sec. 15 (Oxford, New 
York, 1944). 
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where gracL is the component of the gradient operator along the polariza¬ 
tion vector Ao. The contributions to the probability from various fre¬ 
quency ranges are additive, since there are no phase relations between the 
radiation components of different frequencies. 

Each frequency range Aa> in (35.16) can be made infinitesimally small, 
and the summation replaced by an integration. Since the time factor 
has a sharp maximum at co = co^n, the other factors that involve co can 
be taken outside the integral and the limits on 6) extended to ± oo t as was 
done in going from Eq. (29.10) to (29.11). Thus the transition prob¬ 
ability per unit time for an upward transition becomes 


■/: 


J - * 0\OJkn 

4 7r 2 e 2 | r 

= m 2/. 2 / Uki.6x.-p ik ■ r) grad^t u n dr 

m L03 kn | J 


i2*(exp ik • r) gracL u, 

l 

sin 2 — ci>)t j 

~ - a ,) 2 


(35.17) 


where the magnitude of k is now m n /c. An expression very similar to 
(35.17) is obtained for the probability per unit time of a downward 
transition (AV = E n — hu ): 


4ir 2 c 3 

m 2 ^ 


/ 


/(ow) / 'zVfexp (—ik • r)] grad A u n dr\ 


(35.18) 


In this case the magnitude of k is oj„aV c - 

Interpretation in Terms of Absorption and Emission. Equations 
(35.17) and (35.18) give probabilities per unit time for transitions of the 
particle between stationary states under the influence of a classical 
radiation field. These expressions can now be interpreted in terms of 
absorption and emission of quanta of electromagnetic radiation. It is 
necessary to assume that such quanta exist and provide the energy units 
of the radiation field, and that energy is conserved between field and 
particle. The particle gains the amount of energy E k — E n in an upward 
transition under the influence of radiation of angular frequency co*„. The 
quantum energy of this radiation is hu kn = E h — E n , ho that it is reason¬ 
able to associate with the upward transition of the particle the absorption 
of one quantum from the radiation field. 

In similar fashion the downward transition is associated with the 
emission of one quantum whose energy corresponds to the frequency of 
the radiation field. In accordance with Eq. (35.18), the emission prob¬ 
ability is proportional to the intensity of the radiation present. This 
process is therefore referred to as induced emission . 
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It is sometimes convenient to rewrite Eq. (35.18) in terms of the 
reverse transition to that which appears in (36.17). Equation (35.17) 
describes the transition from an initial lower state ntoa final upper state 
k; (35.18) can be made to describe the transition from an initial upper 
state A* to a final lower state n, if n is replaced by k and k' by n . Then 
(35.18) becomes 


4rr 2 e 2 

m*co>l n 


/(«*») 


I* 


tf n [exp (—ik ■ r)] grad A u k dr\ 


(35.19) 


We can now show that the integral in (35.19) is just minus the complex 
conjugate of the integral in (35.17). By means of a partial integration, 
or with the help of Eq. (22.10), the integral in (35.19) is seen to be equal 
to 1 

— 5u h grad A [u n exp (—ik • r)]dr 


Since only the component of the gradient along the polarization vector Ao 
appears, and this direction is perpendicular to the ^propagation vector k, 
the operator grad A does not affect exp (—ik • r). Thus the integral in 
(35.19) is equal to 


— Ju*[exp ( — ik ■ r)] grad A u n dr 

and the square of its magnitude is equal to the square of the magnitude 
of the integral that appears in (35.17). 

Since (35.17) and (35.19) are the same, the probabilities of reverse 
transitions between any pair of states under the influence of the same 
radiation field are equal. 

Electric-dipole Transitions. In most cases of practical interest, the 
wave length of the radiation is many times greater than the linear dimen¬ 
sions of the wave functions that describe the motion of the particle. This 
means that the quantity k • r that appears in the exponential in the 
integral of (35.17) is small in comparison with unity wherever u„ and 
u k are large enough to give an appreciable contribution to the integral. 
A good approximation is then obtained by replacing exp ik • r by 1. 

The resulting integral can be simplified by expressing it as a matrix 
element of the momentum of the particle 


j u k grad A u n dr = | j u k p A u n dr = | (p A )* n 

where p A is the component of the particle momentum p along the direc¬ 
tion of polarization of the incident radiation. The matrix theory of Sec. 

1 In using Eq. (22.10), it must be remembered that the operator i grad, not the 
operator grad, is Hermitian. 
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23 shows that the momentum matrix of the unperturbed particle is given 
by p = m(dx/dt). Thus from Eq. (23.27) 

l d i 

“ (p)fcn = ^ (r)*n — ^ (-®Aj -®n)(r)*n = ^An(r)fcn 

The integral in Eq. (35.17) becomes, in this approximation, 

/ Wl YYl f 

u k grad A u n dr = - j co*,^)** = ~ j <*kn / u k r A u n dT (35.20) 

where r^ is the component of r along the direction of polarization. Equa¬ 
tion (35.20) can, of course, also be derived without recourse to matrix 
methods (see Prob. 3). 

Transitions for which the probability can be computed by substitu¬ 
tion of (35.20) into (35.17) are called electric-dipole transitions , since only 
the matrix element of the electric-dipole moment ex of the particle is 
involved. 1 The transition probabilities per unit time for absorption and 
induced emission then become, in the dipole approximation, 

4-V 2 * 4 

^IM(r A ) kn \|» (35.21) 

It is convenient to denote by (r) ftn the vector whose cartesian components 
are the kn matrix elements of x } y, and 2, and to put 

|(r)*n| 2 = (r)jtn'Tr5*n (35.22) 

which is the scalar product of (r)* n and its complex conjugate. The 
reason for doing this is that there are usually pairs of states k and n for 
which |(r)* n | 2 is the same, but for which the vector (r)* n has various orien¬ 
tations in space. 2 Then if 0 is the angle between (r)* n and the direction 
of polarization of the incident radiation, |(r)*„| 2 cos 2 0 can be substituted 
for |(?m),ii] 2 hi Eq. (35.21), and an average performed over 0. The 
average of (35.21) for such pairs of states is then 

SS (35.23) 

Forbidden Transitions. It may happen that the dipole matrix ele¬ 
ment (r ) kn is zero for particular states k and n. In that case the approxi- 

1 The quantity n iN the electric-dipole moment of the particle of charge c with 

respect to an arbitrarily placed origin; the addition of a constant vector (corresponding 

to a shift in the origin) to r docs not affect the matrix element (35.20), since w* and v n 
are orthogonal. 

4 For example, if the partiele moves in a spherically symmetric potential F(r), tho 
state k can have l = 0, and the states n ean have l = 1 and three values (0, ±1) for 
the magnetic quantum number vi. 
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mate replacement of exp tk • r by 1 in the integral of (35.17) is not 
justified. The exponential can be expanded in a power series 

exp tk • r = 1 + ik - r + i (ik * r) 2 + - * - 

or in a series of spherical harmonics like (19.9) 

exp ik • r = jo(kr) + 3#i(/rr)Pi(cos 0) — 5j 2 (/rr)P 2 (cos $)+••• 

where 0 is the angle between k and r. The second series is more con¬ 
venient than the first if, as is usually the case, the wave functions u k 
and u n can be expressed in terms of spherical harmonics. 

With either series, the dominant factor in the nth term is proportional 
to (kr) n if kr 1 [see the first of Eqs. (15.7)]. Thus if the dipole matrix 
element vanishes but the next term of each series does not, the transition 
matrix element is reduced by a factor that has the order of magnitude ka, 
where the linear dimensions of the particle wave functions are of order a. 
A transition of this type is called a forbidden transition , since its prob¬ 
ability is reduced by a factor ( ka ) 2 with respect to dipole or allowed 
transitions , and usually ka <3C 1. Successive terms in the series can be 
interpreted in terms of electric-dipole, -quadrupole, etc., transitions, and 
involve successively higher powers of ka. 

If both the states u k and u n are spherically symmetric, the integral 
JtZ*(exp tk • r) grad 4 u n dr is identically zero. This can be seen by choos¬ 
ing cartesian coordinates for performing the integration such that the x 
axis is along the direction of polarization. Then gradx u n is an odd 
function of x, whereas u k is an even function of x , and exp ik • r = 
is also even in x, since the vector k is perpendicular to the direction of 
polarization and hence is in the yz plane. Thus the integrand is an odd 
function of x and the integral in (35.17) vanishes. The transition 
between these states is said to be strictly forbidden , since the first-order 
probability given by (35.17) is zero. It is still passible for transitions to 
be produced by higher orders of the perturbation IF given in Eq. (35.13); 
in such a calculation, the previously neglected term e 2 A 2 /2me 2 must be 
included in H '. However, it can be shown with the help of quantum 
electrodynamics that such higher order transitions involve more than 
one quantum, and hence are not simple emission or absorption processes 
in which the quantum energy is equal to the energy difference between the 
unperturbed states k and n of the particle. 


36. SPONTANEOUS EMISSION 

A classical charged oscillator can absorb energy from a radiation field 
or give energy up to it, depending on the phase relation between the field 
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and the oscillator. These effects are analogous to the absorption and 
induced emission of the last section. A classical oscillator also emits 
radiation spontaneously, whether or not an external radiation field is 
present. In this section, we calculate the electromagnetic radiation 
from a classical oscillating charge-current distribution in the absence of 
external fields, and somewhat arbitrarily rewrite the formulas in terms of 
quantum matrix elements to obtain a probability for spontaneous emis¬ 
sion. The results are then verified by comparison with Planck's expres¬ 
sion for the distribution of thermal radiation in a cavity. 

Classical Radiation Field. A charge-current distribution can be 
completely specified by the current density J, since J determines the 
charge density p through the equation of continuity (35.3). In similar 
fashion, the electromagnetic fields in empty space, away from charges 
and currents, can be completely specified by either E or H, because of the 
connections (35.2) between them. Now a wave equation for H is 
readily obtained by taking the curl of the first of Eqs. (35.9). 

(36.1) 

Thus H can be obtained in terms of J alone, whereas the similar equation 
for E involves both J and p (although p can, of course, be eliminated). 
We proceed by solving Eq. (36.1) for H when the three cartesian com¬ 
ponents of J vary harmonically in time with the same angular frequency 
a but not necessarily with the same phase: 

) = 2|J.(r)| cos (c d - y x ) = ,/ x (r)(!-<“' + c.c. 

Ur) = \J.{t)\*» ' ‘ j 

with similar expressions for the y and z components. We are interested 
only in the steady-state solutions for E and H that have the same fre¬ 
quency (a 

Ex(r,/) = 2|Ex(r)| cos {at - fc) = E,(r)c-*“ + c.c. 

H*(r,<) = 2|Hx(r)| cos {at - f*) = H*(r)(r , “‘ + c.c. (36.3) 
E*(r) = |Ex(r)|c* t ', Hx(r) = |H.(r)|c«- 

again with similar expressions for the y and z components. From the 
second of Eqs. (35.2), E is given in terms of H in empty space by 

E(r) = curl H(r) (36.4) 

0) 

With substitutions from (36.2) and (36.3), Eq. (36.1) becomes 

(V 2 + fr s )H(r) = - ~ curl J(r), k = " (36.5) 
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of tt0 type » 6 >: ite ■<■>«*» 
expressed in terms of the Green's function (26.15), and is 

lj/-\ _ 1 f curl J(r') r 

H c J “|r - r'| [exp (*l r - r '| )W (36.6) 

^aToutSw 8 the r“ SOlUti ° n ° f (36 - 6) ’ for large this solution 
, i .° ut f ^ g wave that vanes with r and t like o + c c so 

taLf by S ° Urrial * element a‘ a later ttaekud 

ence is retarded with respect to the current element. 

mom2T, Pt0tiC ^ are “ terested in ^e energy and angular 
77 y the field - As Sh0wn below - the energy can 
field at verv lT 6 temS m tbe asym Pt°tic expression for the 

J?^adStlon si m f/' t r y “ Vr; the an ^ lar momentum requires 

? 1/K The ^pendent part of the 
oegrand of (36.6) can be expanded in powers of 1/r 

ex p (Ojr - r' | ) - 1 / r' C0B 6 + fcfrr'* sin 2 d\ , 

V~ * | r- - r V 1 d r - ) w (36.7) 

where 6 is the angle between r' and r. Substitution of (36 7) into (36 6) 

& 5 M j -— 

ponent E F & P6n ° d ° f the osclIlatlon has as a typical com- 


P '( r ) T l|E»(r)| |H„(r)|[cos (at - &) cos (at - f„)] timoavii . 

c ~ ^ r )l |Hm( r )|[co3 (at - £„) cos (wi - £,)]««.»„.} 

= s ^1 IH*(r)| COS (*. - f.) - | E „(r)| |H,(r)| cos ({, - f.)} 

This and the other two components can be put in the form 


P( f ) = £ AKECr) X HIT)] (36.8) 

^StfrestS^vi'n tho T* ° f ? he , expression tha t follows. Now we 

■as: 
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from the center of the charge-current distribution to the point at which 
the field is measured. Equations (36.4), (36.6), and (36.7) then give, 
to order 1/r (where now r = z), 


■ j ik 

H * ~ n e 

ikr J 


J J^er^dr' 

H.-*0 


E. 

rc j 

f J x (,P)e~ ik ^dr' 

r ik 

J 

f J v {t')e~ iM dr' 

L -+0 



(36.9) 


Partial integration has been used to get rid of the derivatives of J in 
the integrand of H. Equations (36.9) show that the asymptotic fields 
are transverse to the direction of propagation. They also relate the 
polarization of the emitted radiation to the current distribution, and 
show that only the component of the current perpendicular to the direc¬ 
tion of propagation contributes to the radiated energy. Substitution 
into (36.8) gives 

P' = 2^3 (1/ + || JyC-^dT'^j (36.10) 

Equation (36.10) can be generalized to give the average energy flux in 
the direction of a vector k 


J 2 

2nrV 


/ 


•f±k(r')[fcxp ( —* * r')]dr' 


(36.11) 


where J ±k is the component of J perpendicular to k. 

Dipole Radiation. Equation (36.11) is an exact expression for the 
energy radiated by the classical current distribution (36.2). As in Sec. 
35, the electric-dipole approximation is obtained in the long-wave-length 
limit by assuming that hr* <K 1 and replacing exp ( — ik • r') by 1 in the 
integrand. The energy flux is then 


F 

%rr l c 


I Jx k(r')rf/ 


(36.12) 


From Eqs. (36.9) with the same approximation, it is apparent that the 
polarization (direction of the electric field) of the radiation is determined 
by the total current vector J 0 = JJ(r')dr'. The radiation is linearly 
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polarized if J 0 has only one component in the plane perpendicular to the 
direction of propagation, circularly polarized if Jo has two equal com¬ 
ponents in this plane that are perpendicular to each other and 90° out of 
phase (so that one component is i times the other), etc. 

The angular distribution of the radiation can be found by replacing 
|J/ik(r')dr , | 2 in (36.12) by (J 0 - Jo) sin 2 6 = |J 0 | 2 sin 2 6 , where 8 is the 
angle between J 0 and k, and |Jo| 2 is an abbreviation for the scalar product 
of Jo and its complex conjugate. The total power radiated is then the 
integral of (36.12) over the surface of a sphere of radius r 

^ |Jo| 2 (B6.13) 

Angular Momentum. The angular momentum radiated per unit 
time is equal to the torque exerted on a large, perfectly absorbing sphere 
that is centered at the charge-current distribution that constitutes the 
source of radiation. The average energy flux is P, so that the (directed) 
energy density is (l/c)P and the momentum density is (l/c 2 )P. Since 
the radiation travels outward with speed c, the torque exerted on a 
perfectly absorbing differential element of area, dA , that is perpendicular 
to r is cdA times the vector product of r and the momentum density: 
(dA/c )(r X P). Integration of this quantity over the sphere of radius 
r gives the angular momentum radiated by the source per unit time. 
Thus only the components of P tangential to the sphere are involved: in 
the notation of Eqs. (36.9), these are P m and P v , since the z axis is along r. 

If Ef and H* were zero, the tangential components P x and P v would 
also be zero, and no angular momentum would be radiated. The third 
and sixth of Eqs. (36.9) imply only that the z (radial) components of the 
field are of smaller order than 1/r; actually they are of order 1/r 2 . This 
means that P x and P v fall off as 1/r 3 for large r. Then since r X P appears 
in the expression for the angular momentum, and the area of the absorb¬ 
ing sphere is proportional to r 2 , the total angular momentum absorbed by 
a large sphere is independent of r. 

We require the 1/r 2 terms in E, and H«, but not in the other field 
components. 


H» ~ e ' kr f [y’J'i r') - x'J v (t')]e-*-'dT > 

E. —* ^ e !kr J [2J,(r0 + ihx'J x (r') + iky'J^r'^e^dr' 


(36.14) 


Equations (36.9) and (36.14) are sufficient for an exact calculation of the 
radiated angular momentum. 
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Dipole Case. The expressions for P x and P y are simplified by the 
dipole approximation since only the terms of lowest order in kr f need be 
retained in (30.9) and (36.14). It is easy to see that threading term in 
P aj for example, is — (c/2Tr)Re(E a \-\ v ) and not (c/27r)i?e(E v H«). We obtain 
to lowest order in for* 


;««(*' /-W / J ^) 
P,-^Re(i j f -7./J/) 


(36.15) 


Equations (36.15) refer to axes fixed with respect to the element dA 
of absorbing area at r. They must now be rewritten in terms of general 



Fig. 29. Relation botwoon the unprimod coordinate ayatom of Ecjh, (39.15) and the pnmod 
coordinate ayntem of Eq. (36.16). 

cartesian coordinates, in order that the angular-momentum component 
about a particular axis fixed in space can be found. This is analogous to 
the rewriting of the energy-flux expression (36.10) in the general form 
(36.11), but is somewhat more complicated. To accomplish this, we 
choose new cartesian coordinates x',y f } z ' that are fixed in space. With 
respect to these, the orientation of the old axes depends on r in the follow¬ 
ing way (see Fig, 29): the z axis is in the direction of r and has the polar 
angles 0,<£ with respect to the new axes, the y axis is perpendicular to t 
and in the plane of r and z f , and the x axis is perpendicular to the plane of 
r and 2 ; . If now we wish to calculate the contribution to the z f component 
of angular momentum from absorption by the element of area 

dA = f 1 sin 9d0d<j> 
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at r, we need only 1 P a given in (36.15). This can be rewritten in terms of 
the components of the total current vector J 0 along the new axes as 

k 

P 9 = —g- Re[i(J sin 0 cos cj> + Joy' sin 9 sin </> + Jo*' cos 6) 

7T T C 

’ ( J 0 y 1 COS (f) Jo*' sin <£)] (36.16) 

The moment arm about the z' axis associated with P x is r sin 0, so that the 
differential element of angular-momentum component is 

dMj = - r sin 0 P x r 2 sin Bddd<t> (36.17) 

c 

Substitution of (36.16) into (36.17) and integration over the polar angles 
gives 

M; = ~ - J 0y j 0x ,) (36.18) 

It is apparent from Eq. (36.18) that the radiation of a particular 
component of angular momentum depends only on the perpendicular 
components of Jo. Moreover, there must be two such perpendicular 
components that are out of phase with each other, for if Jo*' and Jo U ' are 
both real or have the same phase, the parenthesis in (36.18) is zero. Thus 
a linear dipole (Jo entirely in one direction) radiates no angular momen¬ 
tum. The maximum angular momentum for a given value of |Jo| 2 is 
radiated when Jo has two components that are 90° out of phase with each 
other and the third component is zero. If the nonvanishing components 
are along x f and y* we can put Jo*,' = iJo Eq. (36.18) then becomes 

M.' = ^= % Uol 2 (36.19) 

and the other two components of M are zero. 

Comparison of Eqs. (36.13) and (36.19) shows that the maximum 
angular momentum radiated per unit time by an oscillating electric 
dipole is 1/kc = 1/cc times the energy radiated per unit time. If this 
relation is taken over into quantum theory, it shows that a quantum of 
energy, flu, radiated by an electric dipole carries with it an amount of 
angular momentum that does not exceed h . 

Conversion from Classical to Quantum Theory. We now convert 
the classical expression (36.13) for the power radiated by an electric 
dipole, to quantum theory. This requires that we find a quantum 
analogue for the total current vector Jo and associate the radiated power 
with a transition probability between states of the particle that is doing 
the radiating. 
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We want to replace J by a current density that is associated with an 
initial upper state Uk and a final lower state u ni since energy is radiated 
during the transition from Jc to n. It is natural to represent the current 
density as the product of a charge density and a velocity, and to take for 
the velocity the momentum operator divided by the mass: — ( ih/m ) grad. 
The charge density for a stationary state is expected to be the charge of 
the particle times its position probability density: e\\p\ 2 . However, we 
are concerned here with a transition between states, so we replace this by 
eu n Uk. The way in which the grad that appears in the velocity operates 
on the wave functions that appear in the charge density is determined by 
arguments of the type presented in Sec. 7 [see Eq. (7.3)]. We thus arrive 
at a quantity to substitute for the classical current density: 

J(r) —> - ^w„(r) grad u*(r) (36.20) 


We assume that (36.20) can be substituted into all the foregoing classical 
expressions to give quantum results. 1 

Integration of (36.20) over the coordinates gives the total current 
vector 

J° = ~~ j Un grad Ul ^ T = ~ iec * nh J '^TUicdr = icoj A „(r)ir (36.21) 

with the help of equation (35.20). Substitution of (36.21) into (36.13) 
then gives the radiated power. We interpret this power as the product 
of the spontaneous rate of transition from k to n, and the quantum energy 
hwkn = h(E k — E n ) given off in each transition. The transition prob¬ 
ability per unit time for spontaneous emission then becomes 


Aprk 1 CO kn ./ v 12 _ l/ s 1 2 

Me |( ” Stir* l( 


(36.22) 


where use has been made of the relation = At. 

Planck Distribution Formula. The transition from the classical 
expression (36.13) to the quantum expression (36.22) can claim only a 
moderate amount of plausibility. The correctness of the latter result can 
however be verified by showing that Planck’s formula for the spectral 
distribution of thermal radiation in a cavity follows from (36.22) and 
(35.23). This is the way in which the relation between the probabilities 


1 The exponential that appears in (36.11), for example, can bn planed either before 
or after the grad operator, ninno only the component of grad that is perpendicular to 
k enters in, 
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for absorption, induced emission, and spontaneous emission was first 
obtained. 1 

We assume that the walls of the cavity contain particles of charge e 
and mass m, each of which is bound by a potential V of the type that 
appears in (35.1). When these particles are in equilibrium with thermal 
radiation at the absolute temperature T, there must be as many quanta 
of each frequency emitted as absorbed per unit time. The rate of 
emission of quanta of frequency is the sum of (35.23) and (36.22), 
multiplied by the number of particles that are in the upper state /f. The 
rate of absorption of these quanta is the product of (35.23) and the 
number of particles in the lower state n. Now from statistical mechanics, 3 
the equilibrium ratio of the number of particles in the upper state to the 
number in the lower state is given by er(**-*iO/«r w here k. is Boltzmann's 
constant. We thus obtain, on dropping the subscripts from co**, 

[to 'MK'W + TO IW-I’] ” TO 7WI(r ‘" ,l ‘ 

This is readily solved for I (&>) to give 


/(«) = 


ft to 

ir 2 c 2 (e* r — 1) 


(36.23) 


It is interesting to note that the parameters e, m, and (t)kn of the particle 
that emits and absorbs the radiation drop out of the expression for I(os). 
The agreement between Eq. (36.23) and the Planck distribution formula 
provides a verification of the ratio of (35.23) to (36.22), and hence shows 
that the latter expression is correct if the former is. 

Line Breadth, A classical oscillator that radiates electromagnetic 
waves loses energy, so that the amplitude of its oscillation decreases in 
time. Thus the electromagnetic fields given off by it have a damped 
sinusoidal time dependence: e~* yt cos (wot + a). The Fourier analysis of 
these fields gives the frequency spectrum of the radiation from the oscil¬ 
lator. The radiated intensity per unit frequency range at the angular 
frequency w is proportional to 


1 

(to — coo) 2 + ^ 7 2 


(36.24) 


1 A. Eins tein, Phys. Zeiis. f 18 , 121 (1917). Einstein's A coefficient is just (36.22), 
and his B coefficient is (35.23) divided by the energy density of radiation I (ajh») /c (this 
is expressed in terms of the angular frequency toim rather than the circular frequency 

«*n/2jr). 

2 See, for example, R. C. Tolman, “The Principles of Statistical Mechanics ” Chap. 

IV (Oxford, New York, 1938). 
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According to (36.24), the intensity of the emitted spectral line has half 
its maximum value when o> = w 0 ± iy. The quantity y is called the 
natural line breadth , and in cases of practical interest is small in com¬ 
parison with coo. 

The line breadth is evidently half the initial fractional rate of decrease 
of the amplitude of the classical oscillator, or is equal to the initial frac¬ 
tional rate of decrease of the oscillator’s energy. It is plausible to associ¬ 
ate the rate of decrease of the energy of the classical oscillator with the 
rate of decrease of the probability of finding the corresponding quantum 
system in its initial upper state. If this is done, the quantum analogue of 
the classical natural line breadth y is the initial transition probability per 
unit time for spontaneous emission given in (36.22). 1 

The foregoing relation between transition probability and line breadth 
can be arrived at in a qualitative but more general way by means of the 
uncertainty relation (3.3). The reciprocal of the transition probability 
per unit time is of the order of magnitude of the time that the quantum 
system stays in its upper state. Thus a determination of the energy of 
the upper state cannot occupy a time that is of larger order of magnitude 
than the lifetime 1/y of this state. According to (3.3), this means that 
the energy cannot be determined with an accuracy much greater than 
h divided by the lifetime, or hy. If the energy of the upper state is uncer¬ 
tain by this amount, the frequency of the emitted line will be uncertain 
(broadened) by 7 . In general, a quantum energy level is broadened by 
any process that shortens its lifetime: the level is perfectly sharp only 
if the lifetime of the state is infinite (true stationary energy eigenfunction). 

A qualitative idea of the natural breadth of electric-dipole lines 
emitted by a quantum system can be obtained by rewriting the expres¬ 
sion (36.22) for 7 in the form 


7 




The factor e*/hc is a dimensionless constant that is very nearly equal to 
tJt if e is the electronic charge , 2 and the factor /f a |(r)jb W | a has already been 
assumed small in comparison with unity in arriving at the dipole approxi¬ 
mation. Thus the ratio of line breadth to angular frequency is expected 
to be quite Bmall (it is of the order of 10 “° for typical atomic dipole lines). 


37. SOME APPLICATIONS OF RADIATION THEORY 
The semiclassical radiation theory developed earlier in this chapter is 
applied in this section to the determination of the conditions for allowed 

1 For further discussion of line breadth, see Hcitlcr, op. cit., See. 12. 

2 This is the fine-structure constant that appears in the theory of the fine structure 
of atomic energy levels (see Chap. XII). 
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transitions, and to the theory of the Cerenkov effect. The latter problem 
is of some practical interest, and also shows how one can calculate the 
radiation from a current distribution that does not depend harmonically 
on the time. 

Selection Rules for a Single Particle. The discussion of forbidden 
transitions at the end of Sec. 35 shows that the probabilities for absorption 
and induced emission are reduced by a factor of at least ( ka) 2 with respect 
to allowed transitions if the dipole matrix element (r)* n vanishes. The 
same remarks apply to the probability for spontaneous emission, since the 
integral in (36.11) is the same as that in (35.19) when the substitution 
(36.20) is made for J. 

The conditions on u * and u n for which the dipole matrix element is 
different from zero constitute the selection rules . They are easily formu¬ 
lated if the potential V that appears in the unperturbed Hamiltonian of 
(35.13) is spherically symmetric. It is shown in Sec. 14 that the energy 
eigenfunctions can then be written as products of functions of the radial 
distance r and spherical harmonics 7^(0,<£) defined in Eq. (14.16). 
The matrix element (r) An is the vector whose cartesian components are 
the corresponding matrix elements of x, y , and z. The matrix element of 
z is Ju*rcos 6u n dr f which can be written as a product of an integral over 
r and the angular integral 


/; f 0 2 * Y lm (0,<f> ) cos e Y iw (8,<t>) sin QdQd<t> (37.1) 

where the primed and unprimed subscripts are the angular-momentum 
quantum numbers for the lower state u n and the upper state Uk } respectively. 

The <(> integration in (37.1) is *<!<$>, which is zero unless 

m = m. The integration can then be written, apart from numerical 
factors, 

wPf(w)Pf.(w)dw, w = cos 0 (37.2) 

Now it can be shown with the help of the generating function (14.13) for 
the associated Legendre functions that 


wPf(w) = 


l + 


21 + 1 


^Pr_ 1 (^) + Z ~ H + 1 


2l + i~ p T+ i( w ) 


Substitution of this into (37.2) shows, with the help of the orthogonality 
integral (14.15), that the matrix element of z vanishes unless Tti r — m 
and V = l ± 1. A similar treatment shows that the matrix element of 


x + iy vanishes unless m’ = m - 1 and V = l ± 1, and the matrix ele¬ 
ment of x — iy vanishes unless m' = m + 1 and V - l ± 1. These 
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selection rules determine the possible allowed (electric-dipole) transitions 
of a single charged particle that moves in a central force field. 

Polarization of Emitted Radiation. The discussion following Eq. 
(36.12) shows that the polarization of the emitted radiation is determined 
by the total current vector J 0j and hence (because of 36.21) by the dipole 
matrix element. When the initial and final states have l values that differ 
by unity and the same magnetic quantum number m with respect to the 
z axis, only the matrix element of z fails to vanish. The radiation is then 
linearly polarized along the z axis if viewed in the xy plane, and there is 
no radiation along the z axis. When the magnetic quantum numbers of 
the initial and final states differ by unity, the x and y components of the 
dipole matrix element are 90° out of phase, and the z component vanishes; 
the radiation is then circularly polarized if viewed along the z axis, and 
linearly polarized perpendicular to the z axis if viewed in the xy plane. 
These results are of interest in connection with the polarization of the 
radiation from atoms placed in a magnetic field (see the discussion of the 
Zeeman effect in Sec. 39). 

Conservation of Angular Momentum. The discussion of Eqs. (36.18) 
and (36.19) shows that the angular momentum that is carried away by 
an emitted quantum has its maximum value ft, and is directed along the z 
axis, when Jn y = iJ Qx . From (36.21) we see that this is the case in which 
(y)kn = i{x) kn , or ( y)kn = —i(z)kn- Now X = r sin 8 cos $ = -Jr sin d 
* (e f + + <r _, '*) J and y = r sin 6 sin <£ = —\ir sin d(e L * — Then it is 

apparent from the <j> integration in (37.1) that in order for the matrix 
element of y to equal —i times the matrix of x, the magnetic quantum 
number of the initial state u Ia must be greater than the magnetic quantum 
number of the final state u n by unity. Application of Eq. (14.23) shows 
that the z component of angular momentum of the particle has decreased 
by ft during the transition. Thus angular momentum is conserved between 
the radiating particle and the emitted quantum. 

The foregoing result is based on the connection between the classical 
and quantum current densities assumed in (36.20). The successful 
derivation of the Planck distribution law in Sec. 30 shows that. (36.20) 
is correct so far as magnitude is concerned. The above demonstration 
of the conservation of angular momentum shows in addition that the 
phases of the initial and final states are inserted properly in (36.20); if, 
for example, J had been assumed proportional to Uk grad u n , an incon¬ 
sistent result would have been obtained in the last paragraph. 

If the magnetic quantum number docs not change in a transition, only 
the matrix element of z fails to vanish, and the discussion of (36.18) shows 
that the quantum carries off no angular momentum. This might seem 
at first to be in contradiction with the change by one unit of the orbital- 
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angular-momentum quantum number l . The x and y components of the 
angular momentum of the particle do not commute with the z component 
(which in this case is known to be mh and does not change), so that they 
cannot be precisely specified. Their expectation or average values for 
states that are described by quantum numbers l and m are zero, since the 
diagonal elements of the matrices for the x and y components of the angu¬ 
lar momentum are all zero [see Eqs. (24.16)]. Thus there is no observable 
change in any of the components of the particle's angular momentum, 
and so the expectation value of the angular momentum carried off by 
the quantum should be zero. The x and y angular-momentum com¬ 
ponents for a particle in a stationary state can be thought of as fluctuat¬ 
ing about zero in such a way that their average values are zero, although 
their average squares are not. The change in l corresponds to changes in 
these average squares. 

Selection Rules for Many-particle Systems. When a quantum- 
mechanical system consists of several particles that do not interact with 
each other, the total Hamiltonian is simply a sum of terms like Hq + H f 
that appear in (35.13). The unperturbed energy eigenfunctions are 
products of single-particle eigenfunctions such as are discussed in Sec. 
32 (they can be unsymmetrized if the particles are not identical). It 
is clear that the matrix element that appears in the first-order perturba¬ 
tion theory of Sec. 36 (absorption and induced emission) involves a 
multiple integral of the form 

J ■ ■ • Rf(l)*<2) 

• ■ • [JT'(l) + JT'(2) + ■ • • ]u a (l)u b (2) • ‘ • drdrt ■ * * 

Because of the orthogonality of different u } s for the same particle, this 
integral vanishes unless all the single-particle functions . . . are 
equal to the corresponding functions u a , . . . , except for one. Thus 
only one of the particles can change its state in a transition, and the 
selection rules for a central force field are precisely those given above. 
Since the spontaneous transition probability can be related to the same 
integral through the Planck distribution formula, these selection rules 
hold for spontaneous emission as well as for absorption and induced 
emission. 

If the system consists of several charged particles whose mutual 
interactions cannot be neglected, we must base the selection rules on 
general conservation laws for total angular momentum and parity. It is 
not difficult to generalize the work of Secs. 35 and 36 to show that the 
dominant term, when the wave length of the radiation is large in com¬ 
parison with the dimensions of the system, is the matrix element of the 
total electric-dipole moment e^i + e 2 r 2 + • • • . The angular momen- 
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turn radiated by an oscillating dipole cannot, according to the discussion 
of (36.19), exceed h per quantum; this classical argument is based on an 
arbitrary current distribution, so that it is not invalidated if several 
particles contribute to the dipole. The interpretation of the conservation 
of angular momentum between the emitted quantum and the radiat¬ 
ing system is complicated by the semiclassical nature of the entire treat¬ 
ment. A consistent treatment based on quantum electrodynamics shows 
that the correct selection rule based on this conservation law is that the 
total-angular-momentum quantum number of the system can remain 
unchanged, or increase or decrease by unity. An exceptional case is 
that in which this quantum number is zero for both initial and final states. 
The initial and final wave functions are then spherically symmetric, and 
an extension of the argument given at the end of Sec. 35 shows that a 
radiative transition between these states is not only forbidden (no allowed 
dipole transition), but strictly forbidden (no first-order transition 
whatever). 

The discussion of parity in connection with Eq. (23.26) shows that the 
parity of each energy eigenfunction can be well defined (even or odd) if 
the total Hamiltonian is unchanged by reflection of the coordinates of all 
particles through the origin. All known Hamiltonians are of this type. 
Now the electric-dipole-moment operator given above is evidently odd 
with respect to reflection of all coordinates; thus its matrix elements van¬ 
ish unless the initial and final states have opposite parities. This selec¬ 
tion rule is known as the Laporte rule. 

Cerenkov Effect. It is not difficult to show that a charged particle 
that moves freely in empty space does not radiate energy. Electro¬ 
magnetic radiation has the greatest momentum for a given amount of 
energy when it is all flowing in one direction, in which case the ratio of 
momentum to energy is 1/c. The relativistic relation between the total 
energy E (including rest energy) and the magnitude p of the momentum 
of a freely moving particle is E 2 = p 2 c 2 + m 2 c 4 , where m is the rest mass. 1 
Differentiation of this relation gives 2 EdE = 2 pcHp> so that the ratio of 
momentum loss to energy loss of the particle if it radiates is 

dp = jL = i r, , /wyr 

dE pc 1 c L \ V) J 

which is always greater than 1/c. Thus it is impossible for a freely 
moving particle to radiate, since if energy were conserved between the 
particle and the electromagnetic field, momentum would not be. 

1 See, for example, R G. Bergniann, “ Introduction to the Theory of Relativity,” 
Chap. VI (Prentice-Hall, New York, lt)42). 



262 


QUANTUM MECHANICS 


[Chap. X 


The situation is altered if other matter is present to take up the excess 
momentum. An electron bound in an atom or an electron that passes 
near an atomic nucleus can radiate; the latter case is usually described in 
terms of the deflection of the electron by the nucleus and the emission of 
radiation by the accelerated electron. An interesting example in which 
the presence of matter makes it possible for the conservation laws to be 
satisfied is provided by the uniform motion of a charged particle through 
a medium of refractive index n, when the speed v of the particle is greater 
than the speed of light in the medium: 

» > £ (37.3) 

n 

The radiation from fast electrons that satisfy (37.3) has been observed 
by Cerenkov. 1 The theory of the Cerenkov effect 2 is developed on the 
basis of classical radiation theory in the remainder of this section. This 
treatment is satisfactory in so far as the structure of the medium can be 
represented by a refractive index. 

Expression for the Current Density. We assume that the moving 
charge has dimensions that are small in comparison with emitted wave 
lengths of interest. Then if the particle has charge e, moves in the z 
direction with the speed v , and is at the origin at t = 0, the current density 
can be written 

J X {tyt) = = 0 4") 

Jj(r y t) = ev8(x)8(y)8(z — vt) 

The Dirac 8 function is that introduced in Sec. 11. Since the radiation 
theory developed in Sec. 36 is based on a current density that depends 
harmonically on the time, must be expressed in terms of a Fourier 
integral 

- f Q °°[J M>(r)e~*"' + c.c.]d« (37.5) 

The Fourier amplitude of angular frequency w is given by 

J za.(r) = £ 8(x)S(y)eT (37.6) 

This can be verified by substituting (37.6) into (37.5) to obtain 
J.(r,0 = £ 5(x)S(y) jj + ^(H]^ 

= £ Kx)5(y) J e ‘“G _ 0(f w 

1 P. A. Cerenkov, Phys. Rev., 52, 378 (1937). 

* I. Frank and I. Tamm, Comptes Rendua de VAcod, Sci. U.R.S.S. , 14, 100 (1037). 
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which with the help of (11.10) and the fifth of Eqs. (11.13) is seen to 
agree with (37.4). 

Fourier Analysis of the Radiation Field. The electromagnetic fields 
can be expressed in analogy with (37.5): 

EM = f [E u (r)e-'“* + c.c.tfa, 

7. (37.7) 

H(r } t) = j Q [H w (r)6" tui + c.c .]da> 

Because of the linear dependence of E and H on J through Maxwell’s 
equations, each of the Fourier components E« and H M can be obtained 
entirely from the corresponding component J Z(a of the current density. 
The Poynting vector obtained from (37.7) is 

M x HM = ~ fj [EM x 

+ E w (r) X + E u (r) X 

+ E^r) X H</r )e^'»\do>dd (37.8) 

Equation (37.8) can be used to find the spectral energy distribution 
of the radiation. The distribution of the radiated energy in frequency is 
usually of more interest than the distribution in time, if the emission 
process occupies a time that is short by laboratory standards. We there¬ 
fore integrate (37.8) over t from — qo to + °o to obtain the total flow of 
energy per unit area. Since t appears only in the exponentials, we can 
use Eq. (11.10) to obtain 5 functions in the frequencies. Then when the 
integration over «' is carried through, the first two terms in the integrand 
of (37.8) contribute nothing, and the last two terms yield 

b E(r ’*> x H( - T ' t)dt 

= f Q [EM x H„(r) + EM x HM]d <0 (37.9) 

The integrand of (37.9) is just twice the real part of E w (r) X H u (r), and 
in accordance with (30.8) can be called (^AOP^r). Since (37.9) has 
the form of an integral over frequency, we see that 27rP 0 ,(r)dw is the flow 
per unit area of energy that lies in the angular-frequency range o> to 
co + dto. 

We assume that the medium with which we are concerned is a homo¬ 
geneous isotropic dielectric that is characterized by a dielectric constant 
or specific inductive capacity e. This means that c is to be replaced by 
c/e* = c/n in Secs. 35 and 36, where the refractive index n of the medium 
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is equal to e*; also, k is replaced by nk. Thus (36.11) becomes, in our 
present notation, 

Pt»(x) = ^~ c |/ J xk,«(r') exp (-ink • r')dr' 

Here P**, is the component of P w in the direction of observation (parallel 
to k or r), and the magnitude of k has the earlier value co/c. 

Radiated Energy. The direction of observation is taken to have 
polar angles 0,<£ with respect to the z axis. Then k • r' in (37.10) is equal 
to k(x' sin 0 cos <t> + y' sin 0 sin + z' cos 0), and J sin & 

Substitution of (37.6) into (37.10) then gives for the energy flow per unit 
area and angular frequency 


(37.10) 


ZarPtofr) 2 tt ^ sin 2 0 


m 


S(x')S(y')e 


lOiZ' 

V 


ne 2 w 2 sin 2 0 
4nr 2 c z r 1 


. g—tnACa/ ain ^ ooh ^4l/'nin oob (j[y f 

J exp j iuz' j dz'J (37.11) 


In order that (37.11) be definite, the particle can be considered as radiat¬ 
ing only over a finite length L of its path, so that the integral over z' 
extends, say, from — to ^L. 

The squared integral in (37.11) is easily evaluated to give 


4 sin- [i«L 0 - l^L'V 

2 a _ n cos g y 
" v c ) 


(37.12) 


For large L, this has a sharp maximum when the denominator is zero, at 


cos do — —- (37.13) 

nv 

Thus the radiation appears at a cone in the forward direction whose half 
angle decreases as nv/c decreases. Since 0o becomes imaginary if v < c/n } 
there is no radiation if the condition (37.3) is violated; in particular, there 
is no radiation if n = 1 (empty space), since v is always less then c. The 
fact that (37.12) does not quite vanish for cos 0 less than unity even if 
v < c/n is due to the choice of a finite path length L; the starting and 
stopping of the charge at the ends of its path mean that the particle is 
accelerated, and this makes some radiation possible. 
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The total energy radiated involves an integral of (37.11) over the 
surface of a sphere of radius r. This can be evaluated like the integral in 

(29.11) , by making use of the sharpness of the maximum to extend the 
limits on cos 0 to ± «> and to replace sin 2 6 by sin 2 6 0 obtained from 
(37.13). The integral of (37.12) is 4x 2 cLr 2 /nw. Substitution into 

(37.11) gives for the total energy radiated by the particle per unit angular- 
frequency range in the distance L 

coe 2 L sin 2 0 O ioe 2 L / c 2 \ 

c 2 = V nV/ 

The number of quanta with angular frequency between a> and <a + dco 
emitted per unit distance by a particle of charge e moving with a speed v 
through a dielectric of refractive index n is then 

S? ( l " !S>) “ fir (l - T » 7 ' 14 > 

if 6 is the electronic charge. Thus the number of quanta emitted per 
unit frequency range depends on a only through n. For a very fast elec¬ 
tron (v = c) passing through water (n = 1.33), Eq. (37.14) shows that 
there are about 230 quanta emitted per centimeter of path in the visible 
region (wave lengths from 4,000 to 7,600 A). 

Problems 

1. Show that if J = p « 0, the most general solution of Maxwell's equations can 
be expressed in termB of potentials such that div A «</>*= 0. 

2. Estimate the order of magnitude of eA/cp , when e is the electronic cliargo, 
A is the magnitude of the vector potential for the visible part of the spectrum that 
corresponds to the radiation in a cavity at several thousand degrees centigrade, and 
p is the momentum of an electron in the first excited state of hydrogen. 

3. Verify Eq. (35.20) by means of the wave equation, without recourse to matrix 
methods. 

4. Show that the transition probability for spontaneous emission is equal to the 
transition probability for induced emission that would result from an isotropic field of 
such intensity that there is one quantum per state of the field in the neighborhood of 
the transition frequency. 

5. A hydrogen atom in its first excited (2P) state is placed in a cavity. At what 
temperature of the cavity are the transition probabilities for spontanoous and induced 
emission equal? 

6. What is the spontaneous transition probability per unit time, expressed in 
sec -1 , of a hydrogen atom in its first excited state? 

7. What ih the selection rule for allowed transitions of a linear harmonic oscillator? 
What is tho spontaneous transition probability per unit time,*expressed in sec'" 1 , of an 
oscillator in its first excited state, when c, m t and w are the same as in Prob. 0? 

8. Show that a logarithmic factor like that in Eq. (30.11) always appears in the 
cross section for excitation of a onc-ricctron utorn by electron impact, if tho corre* 



266 


QUANTUM MECHANICS 


[Chap. X 


sponding radiative transition is allowed. Derive the simplest relation that you can 
between the differential cross section for excitation by electron impact and the corre¬ 
sponding transition probability for spontaneous emission, assuming the transition 
is allowed. 

9. Starting from Eq. (37.5), derive a general expression for t/* u (r) in terms of 
assuming that the latter is real. Use this to verify Eq. (37.6) when i/*(r,0 is 
given by (37.4). 

10. What is the polarization of the Cerenkov radiation? Is any angular momen¬ 
tum carried off by the radiation field in this case? 



CHAPTER XI 

ATOMS, MOLECULES, AND ATOMIC NUCLEI 


This chapter is not intended to be a complete survey of the properties 
of atomic systems. It is primarily a presentation of a relatively few 
problems that arise in connection with the structure of matter, selected 
because they supply interesting and instructive applications of quantum 
mechanics. These problems are grouped according to subject; enough 
explanatory material is included so that the treatment is coherent, 
although severely limited in scope. 

38. APPROXIMATIONS IN ATOMIC STRUCTURE 1 

The ground states of the two lightest atoms, hydrogen and helium, were 
considered in Secs. 16 and 27, respectively. Variation calculations simi¬ 
lar to those described for helium have been carried through for others of 
the light atoms. This section describes some of the approximations that 
have been used for the heavier atoms. The alkali atoms are discussed 
separately in Sec. 39. 

Central-field Approximation. The starting point of calculations on all 
except the lightest atoms is the central-jicld approximation. This assumes 
that each of the atomic electrons moves in a spherically symmetric poten¬ 
tial energy V(r) that is produced by the nucleus and all the other elec¬ 
trons. The approximation is a good one if the deviation from the V(r) 
for one electron produced by close passage of other electrons is relatively 
small. This is actually the case, since the constant nuclear potential is 
of the order of Z times as large as the fluctuating potential due to each 
nearby electron, and the latter varies quite slowly (inversely) with the 
separation distance. The two principal problems are then the calcula¬ 
tion of the central field, and the correction of the approximate results 
obtained from it. Before considering these problems, we discuss some 
general properties of the central field. 

The potential energy F(r) for a neutral atom has the Coulomb form 
— e 2 /r at a great distance r from the nucleus, since the removal of the 
electron whose potential is being measured leaves a singly charged positive 

1 For a more detailed dismission of the material of this section and the next, see 
E. U. Condon and 0. H. Shortley, “The Theory of Atomic Spectra 1 ' (Cambridge, 
London, and Macmillan, New York, 1935). 
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ion. The electron in the hydrogen atom, for which the potential energy 
is — e 2 /r at all r , was shown in Sec. 16 to have an infinite number of 
bound energy levels characterized by the quantum numbers n, l f and m. 
An infinite number of energy levels is also expected for 7(r), since for 
large n , the electron wave function iB small near the nucleus, and only the 
form of V(r) for large r is significant. An important difference between 
the two situations is that the degeneracy between states of the same n 
and different l that occurs in hydrogen is removed in a non-Coulomb 
central field. This is because the electrons that have smaller angular 
momentum penetrate closer to the nucleus, and V(r) is stronger (more 
negative) than —e*/r there, since the nucleus is less completely screened 
by the other electrons. Thus for given n, the states of lowest l have the 
lowest energy. The degeneracy with respect to m is not affected, since 
this occurs whenever the potential is spherically symmetric. 

Because of the spin, four quantum numbers n, l, mi, and m a are 
required to specify the state of an electron in a central field. The orbital 
quantum numbers l and m t are the same as l and m in the hydrogen atom, 
m a = specifies the spin orientation, and n is a natural generalization 
of the total quantum number that appears in hydrogen. Equation 
(16.14) shows that n — l — 1 is the number of nodes of the radial part 
of the hydrogen wave function; this definition of n is carried over to the 
general central field, so that l does not exceed n — 1. 

Periodic System of the Elements. According to the Pauli exclusion 
principle (see the discussion of antisymmetric wave functions in Sec. 32), 
not more than one electron in an atom can have a particular set of values 
of the four quantum numbers given above. As Z increases, electrons fill 
the one-electron states of lowest energy in succession; the ground state 
of an atom in the central-field approximation is that in which there are 
no unfilled electron states that have lower energy than any that are 
occupied. Because of the degeneracy with respect to mi and m n , there 
can be as many as 2(2Z + 1) electrons with the same energy in a shell 
that is specified by n and l. It is apparent then that the ground-state 
configuration of the electrons in an atom can be described by specifying 
the number of electrons in each shell. In the central-field approximation, 
all shells that contain any electrons are full except perhaps that which 
has the highest energy. 

The chemical properties of atoms are determined for the most part 
by the least tightly bound, or valence , electrons, which are in the shell of 
highest energy. The most important factors are the number of occupied 
and unoccupied electron states in this shell, and the energy interval 
between this and the next higher (empty) shell. For example, an atom 
tends to be chemically inert if its highest shell is full and there is an 
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appreciable energy gap to the next higher shell, since then electrons are 
not readily shared with other atoms to form a molecule. The quasi- 
peri odic recurrence of similar highest shell structures as Z increases is 
responsible for the periodic system of the chemical elements. 

In the usual spectroscopic notation, the n value of a shell is given as a 
number, the l value as a letter, and the number of electrons in the shell 
as a numerical superscript. The letter code for l, and the maximum num¬ 
ber 2(2 1 + 1) of electrons in a shell, are as follows: 

1 = 0,1, 2, 3, 4, 5, . . . 

S ) P j d, /, g } h f . . . 

2(2Z + 1) = 2, 6, 10, 14, 18, 22, . . . 

For example, the ground-state configurations of sodium (Z — 11) and of 
mercury (Z = 80) are 

Na: ls 2 2s 2 2p fl 3s 

Hg: 1 s 2 2s a 2p 0 3s 2 3pH5 2 3d lo 4p fl 6s 2 4d lo 5p fl 6s 2 4/ i4 5d 10 

The ground-state configurations of many of the elements can be 
written down simply from a knowledge of the order in which the energies 
of the shells increase. This order can be inferred from spectroscopic 
evidence, and is as follows: 

ls,2s,2p,3s,3p,[4s,3d],4p,[5s,4d],5p,[6s,4/,5d],6p,[7a,5/,6d] 

The brackets enclose shells that have so nearly the same energy that they 
are not always filled in sequence. These shell energies are close together 
because the increase in n and the decrease in l tend to compensate each 
other; thus the 4s state, which has a higher energy than the 3d state in 
hydrogen, is depressed by the penetration caused by its low angular 
momentum. The s shell in each bracket is always filled first, although it 
can lose one or both of its electrons as the other shells in the bracket fill 
up. Apart from the brackets, there are no deviations from the indicated 
order of filling. 

Table 2 gives the ground-state configurations of each of the elements. 1 
An atom contains all the full shells that occur above and to the left of its 
position in the table. Since the number of s electrons varies as each d 
shell fills, the d columns are subdivided to show the number of s electrons. 

1 This table is taken from Condon and Shortlcy, op. cii. , p. 333. Recent data and 
estimates on the rare earths, the heaviest natural elements, and the artificial trans- 
uranio elements, neptunium (Np, 93), plutonium (Pu, 94), americium (Am, 95), and 
curium (Cm, 90), arc reviewed by W. F. Meggers, Science , 105, 514 (1947); tho ele¬ 
ment names technetium (Te, 43), astatino (At, 85), and francium (Fa, 87) are as 
given by O. T. Seaborg, Science , 106, 349 (1947). 
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Table 2. Ground-State Electron Configurations of the Elements 
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The two groups of atoms that appear to have a partially full / shell in 
their ground-state configurations fit in at * (rare earths) and at f (heaviest 
elements). The first group has the 6s shell full and the second has the 
7s shell full; the distribution of electrons in d and/ shells for each group is 
shown below the main table. According to Meggers, the entries in the / 
groups that are enclosed in parentheses are extrapolations or predictions 
based on the analyses of neighboring spectra. 

A few of the periodicities are worth explicit mention. The first 
electron to go into each s shell beyond Is gives an alkali, and the elements 
just before each of these (full Is shell or a full p shell) are rare gases. The 
elements with the same number of electrons in a p shell have similar 
chemical properties; this is especially striking in the case of the halogens 
(one electron short of a full p shell). The elements with full 2s and 3s 
shells (Be and Mg) that are followed by p shells have somewhat different 
properties from the alkaline earths, which have full s shells followed by d 
or / shells. The filling of the 4s,3d shells gives elements somewhat similar 
to those arising from the filling of the 5s,4d shells. The elements that 
correspond to full bracketed shells (Zn, Cd, and Hg) are quite similar, as 
are the noble metals (Cu, Ag, and Au) in which an s electron is missing 
from the full bracketed shells. 

Thomas-Fermi Statistical Model. We now turn to the first of the 
problems associated with the central-field approximation. There are two 
methods that have been used for the determination of the potential 
energy 7(r). The first of these, due to Thomas 1 and Fermi, 2 is discussed 
here, and the second, due to Hartree, is taken up later. The Thomas - 
Fermi statistical model assumes that 7(r) varies slowly enough in an 
electron wave length so that many electrons can be localized within a 
volume over which the potential changes by a small fraction of itself. 
The electrons can then be treated by statistical mechanics, and obey the 
Fermi-Dirac statistics mentioned in Sec. 32. At normal temperatures, 
the thermal energy kT is very small in comparison with 7(r) eveiywhere 
except at the edge of the atom, where the chance of finding an electron is 
small. In this case, the Fermi-Dirac statistics requires that the electron 
states fill in order of increasing energy, as assumed above. The difference 
between the present treatment and the more general discussion given 
earlier in this section lies in the additional assumption that 7(r) is 
sensibly constant over a region in which many electrons can be localized. 

The number of electron states in a cube of edge length L at the walls 
of which the wave functions obey periodic boundary conditions waa 
computed in Sec. 11 to be (L/2Tr)hlkjikydk s . This must be multiplied by 

1 L. II. Thomas, Prim. Camb . Phil. Soc., 23, 542 (1927). 

2 E. Fermi, Zv.itn. f. Phyxik, 48, 73 (1928). 
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2 to take account of the two possible spin states; then the number of 
states for which the magnitude of the momentum p = ftk is less than or 
equal to po is 

3 ©77/.X m Bta " Sr- 

If all these states are occupied, the number of electrons per unit volume 
whose kinetic energy does not exceed p\/2m is Pq/37t 2 A 3 . Now the maxi¬ 
mum kinetic energy at any distance r from the nucleus is — V(r) } since 
otherwise electrons would escape from the atom. We thus obtain a rela¬ 
tion between the volume density of electrons, n(r), and the potential 
energy 

»(r) = (3fU) 

The electrostatic potential V(r)/e is also determined by Poisson’s 
equation in terms of the charge density enir) 

7 vay = I ( r! w) = ~ 4ven{r) (38 ' 2) 

Equations (38.1) and (38.2) are two simultaneous equations for n and V. 
The boundary conditions on the solutions can be expressed in terms of V 
alone for a neutral atom of atomic number Z. As r —> 0, the leading term 
in the potential energy must be due to the nucleus, so that V(r ) —» —Ze*/r t 
As r—> there must be no net charge inside the sphere of radius r , 
so that V falls off more rapidly than 1/r, and rV[f) —> 0. The boundary 
condition at infinity is different from that assumed earlier in this section, 
where V was taken to have the asymptotic form — e 2 /r. The V discussed 
earlier is the potential experienced by one of the atomic electrons, while 
the Thomafl-Fermi potential is that experienced by an infinitesimal test 
charge. The difference between the two potentials emphasizes the 
statistical nature of the approximation made by Thomas and Fermi. 
The solution for V is exact in the limit in which m becomes infinite and 
e becomes zero in such a way that m z e 4 remains constant; then the elec¬ 
tron wave length becomes zero, and the density of particles becomes 
infinite. In this limit the potential is constant over many wave lengths, 
and enough particles are present so that statistical mechanics can be 
applied. 

Evaluation of the Potential. Elimination of n(r) from Eqs. (38.1) 
and (38.2) leads to an equation for — V(r) 

l_d_\ a d(-F) 1 4g 2 [—2 mV(x)\i 

r 2 dr l dr J 37 rh z 


(38.3) 
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Equation (38.3) and the boundary conditions given above are conven¬ 
iently expressed in a dimensionless form in which Z, E, m, and h appear 
only in soale factors. We put 


7(r) = -y-x, r = bx 

, = I /^Y K1 = 0-885fflo 
7 2 \ 4 / me 2 Z* Z* 


(38.4) 


where a 0 = h 2 /me 2 . With these substitutions, (38.3) becomes 


= x i 
x dx 2 x 

X — 1 at x = 0, x = 0 at x = oo 


(38.5) 


The most accurate solution of Eq. (38.5) was computed by Bush and 
Caldwell 1 with the help of the differential analyzer, and is expressed in 
the form of a numerical table. 

Equations (38.4) show that the “radius” of an atom is inversely 
proportional to the cube root of the atomic number, if this radius is inter¬ 
preted to be that of a sphere that encloses a fixed fraction of all the elec¬ 
trons (see Prob. 1). These equations can also be used to show that the 
Thomus-Fermi approximation improves with increasing Z . The poten¬ 
tial at the atomic radius is proportional to Z\ so that a typical electron 
wave length is proportional to Z~K The distance over which the poten¬ 
tial changes by a definite fraction of itself is proportional to the atomic 
radius, or Z~ *. Thus the fractional change of the potential in an electron 
wave length is proportional to Z~\ and decreases with increasing Z, 
Moreover, since the number of electrons is equal to Z } the use of the 
statistical method is better justified as Z increases. 

Hartree’s Self-consistent Fields. The second method for obtaining 
a central field is due to Hartree. 2 This model assumes that each electron 
moves in a central field that can be calculated from the nuclear potential 
and the wave functions of all the other electrons, by assuming that the 
charge density associated with an electron is e times its position prob¬ 
ability density. The Schrodingcr equation is solved for each electron in 
its own central field, and the resulting wave functions made consistent 
with the fields from which they are calculated. Thus the /Hh electron is 
described by a normalized wave function w*(r fc ) that is a solution of the 
equation 


1 V. Bush and 8. IT. Caldwell, Phys. Rev., 38, 181)8 (1031). 

2 D. R. Hartreo, Proc. C'amb. Phil. 8oc. } 24, 111 (1928). 
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jin, — ^ J l u ^ r J')l 2 ^ ^vj w *( r *) = «* M *( r *) (38.0) 

jVA 

where = |r* — r*|. If there are Z electrons in the atom, (38.6) con¬ 
stitutes a set of Z simultaneous nonlinear integrodifferential equations 
for the Z functions w*(r*). It is therefore not feasible to solve these equa¬ 
tions directly, and Hartree used a method of successive approximations. 

A potential energy that approximately represents the second and 
third terms in (38.6) is assumed, electron wave functions computed, and 
new potentials for each electron found from these wave functions. This 
process is continued until the potentials are self-consistent to a high order 
of accuracy. The principal approximation made is the averaging of the 
potential energy given as the third term in (38.6) over the angles of 
r* to make it spherically symmetric. The solutions of (38.6) can then be 
expressed as products of radial functions and spherical harmonics. A 
further simplification is made so that the 2(21+1) or fewer electrons in a 
shell all move in the same potential and have the same radial wave 
function. 

It is apparent that the Hartree approximation neglects correlations 
between the positions of the electrons, since the entire wave function 
for all the electrons is assumed to be a simple product of one-electron 
functions 


^(ri>r2, ■ • • j r ^) — * * * Uz(tz) (38.7) 

It is also clear from (38.7) that antisymmetrized wave functions are not 
employed. The antisymmetry is considered only in so far as the quan¬ 
tum numbers of the one-electron states Uk are chosen in agreement with 
the exclusion principle. 

Connection with the Variation Method. We now show that the 
Hartree approximation results from an optimum variation calculation 
with the trial function (38.7). 1 The wave equation with inclusion of 
interelectronic interactions but neglect of spin-orbit terms (see below) is 


H\p = E\p 



(38.8) 


where j > k implies a double summation, over all different pairs of 
indices j and k. We wish to minimize the expectation value of II. 


1 J. C. Slater, Phys. Rev. } 36, 210 (1930); V. Fock, Zeits.f. Phyaik 61, 126 (1930). 
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From (38.7) and (38.8) we obtain 


f ■ ■ ■ j \f/H\l/dri • ■ ■ dr x 

= ^ J (“ V * “ 7 r) Uk( ' Tk)drh 

+ V V l f fli(r/)«*(r*) ^ Ui(ji)u k (t k )dTjdT k (38.9) 

3>k 

since the u k are normalized. The optimum ^ is obtained by varying 
each of the u k separately to minimize (38.9). The only dependence of 
(38.9) on a particular one-electron function u k is through the terms 


/ u k (Tk) (“ ^ v * “ 77 ) 


Uj{Tj)uk(tk) — Uj{tj)u k {x k )dTjdT k 
Tjk 


Iff 

j^k 

= J ilk{x k )H k u k (j k )dT k (38.10) 


The integral in (38.10) is the expectation value of the operator II k for 
the function Uk. From the discussion of Sec. 27, it follows that this is a 
minimum when Uk is an eigenfunction of Hu that corresponds to its low¬ 
est eigenvalue €&. 

H k Uk = tkUk (38.11) 


Since Eqs. (38.11) and (38.6) are identical, we see that the Hartree wave 
functions are the best, from the point of view of the variation method that 
can be written in the form (38.7). 

The energy associated with this wave function is just the integral 
(38.9), which can be written with the help of (38.6) 


/ 7 


\[/II\l/dT 1 


d t x = ek 

k 

-IW |u/(rj)| 2 |w*(rfc)| 2 drjdrk (38.12) 


The electrostatic interaction terms between electrons are counted twice 
in the summation over €*, and so have to be subtracted out to give (38.12). 
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Thus the energy of the atom is not just the sum of the e*, although each 
eu is roughly the energy of removal of the Hh electron. This last is not 
strictly true, since the removal of an electron alters the self-consistent 
fields, and hence the wave functions and e’s for the remaining electrons. 
However, e* is found to be an especially good approximation to the energy 
of removal in the case of an inner electron (X-ray level). 

Corrections to the Central-field Approximation. We now turn to the 
second problem mentioned at the beginning of this section, the correction 
of the approximate results obtained from the central field. Two terms 
are omitted in the central-field approximation: the difference between the 
actual electrostatic interaction between electrons and the average inter¬ 
action that is included in the central field, and the spin-orbit energy. 
The latter is an interaction energy between the spin and the orbital 
motion of each electron, and has the form 

2^r k )U- S* (38.13) 

k 

Here, L* is the orbital-angular-momentum operator r* X Pa of the frth 
electron, and has the properties of the M operator introduced in Sec. 24; 
the eigenvalues of L\ and Lkz are given in terms of the quantum numbers 
l and mi for the Hh electron as 1(1 + 1 )h 2 and respectively. S* is the 
spin angular momentum ihd k of the Hh electron that was introduced in 
Sec. 33. The function £(r) is given by 1 

« r > - ? < 38 ' u) 

in terms of the central-field potential energy F(r). 

In considering the effects of these terms, we shall assume that the 
perturbed eigenfunctions, which are linear combinations of various con¬ 
figuration wave functions, have only negligibly small amounts of all but 
one configuration mixed in them. From Eq. (25.9), it is apparent that 
this is the case if the interconfiguration matrix elements of the perturba¬ 
tion are small in comparison with the energy intervals between unper¬ 
turbed configuration energies. 

It can be shown that the part of the summation in (38.13) that 
includes electrons in full shells is zero, since the function £ is the same for 
all electrons in a shell and the contributions from electrons with opposite 
mi and m B cancel. ThuB the electrons in full shells can be ignored and the 

J L. H. Thomas, Nature , 117, 514 (1926). This energy is a consequence of rela¬ 
tivity, and is derived as such in Chap. XII. It was first obtained from the precession 
of the spin axis of the electron, part of which is of electromagnetic origin (Larmor 
precession) and part of which comes from relativistic kinematics (Thomas precession). 
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summation extended only over the remaining electrons. The case in 
which there is just one electron outside full shells is of interest in connec¬ 
tion with the ground state and low excited states of the alkali atoms, and 
will be discussed in some detail in the next section. For the present, we 
consider very briefly the more general situation, always assuming that 
each atomic state is based on just one configuration of the electrons. 

LS Coupling Scheme. There are, in general, a number of states that 
belong to the same configuration, and that are degenerate in the central- 
field approximation. These states differ in the assignment of m t and m a 
quantum numbers to the individual electrons. The theory of complex 
spectra consists in determining the linear combinations of such suitably 
antisymmetrized wave functions that diagonalize the perturbation to 
first order (see Sec. 25), along with the corresponding perturbed energy 
levels. 

The most usual situation is that in which the hitherto-neglected 
electrostatic terms are larger than the spin-orbit energy; this is called the 
RusselUSaundcrs caseA States of the same configuration can be clas¬ 
sified as eigenfunctions of any dynamical variables that commute with the 
Hamiltonian and hence are constants of the motion (see Sec. 23). When 
all perturbations are included, the only true constants of the motion are 
the total parity and the total angular momentum J of the electrons 

J= L + S = £ (U + S*) (38.15) 

k 

J is a constant because the angles that specify the orientation of tho 
atom as a whole, and that are the canonically conjugate variables to the 
components of J, do not appear in the Hamiltonian of an isolated atom. 
When the electrostatic perturbation is included but the spin-orbit energy 
neglected, the same argument can be applied to show that the total orbital 
angular momentum L and the total spin angular momentum S are 
separately constants of the motion. The individual S k need not be 
constants, even though no spin-dependent forces act in this approxima¬ 
tion, since the use of antisymmetric wave functions couples the spins to 
the electrostatic energy (see the discussion of the excited states of helium 
in Sec. 33). 

A state can be specified by the quantum numbers J, L , S, M, Mi, and 
Ms, which are connected with eigenvalues of angular-moment\un opera¬ 
tors through 

J2 = ./(,/ + 1 )h\ J m = Mh 

L 2 = L(L + 1 )h\ U * Mih 

S 2 - S(S + 1 )h\ S z = M,h 

1 H. N. Russell and F. A. Saunders, Astrophys. Jour., 61, 38 (1926). 


(38.16) 



278 QUANTUM MECHANICS [Chap. XI 

When the spin-orbit energy is neglected, the electrostatic energy separates 
states of different L; in some cases, only particular S values are permitted 
because of the exclusion principle. Only two of the other four quantum 
numbers are independent, so we can use either LSM lM $ or LSJM to 
specify a state. Because of the spherical symmetry of the Hamiltonian 
with respect to its space and spin parts separately, the energy is inde¬ 
pendent of the directional quantum numbers Ml and Mr, and there are 
(2 L + 1)(2S + 1) degenerate states. For given L and S f the states 
specified by J and M are linear combinations of those specified by Ml 
and M S} so that the same amount of degeneracy appeal’s in the LSJM 
representation. This is called the LS coupling scheme , since the indi¬ 
vidual L k are coupled together to form the total L , and the individual 
to form the total S. 

If now the spin-orbit energy is included, L and S are no longer con¬ 
stants of the motion, although J and M still are. However, we assume 
that states of different L and S are sufficiently well separated by the 
electrostatic energy so that their mixing due to spin-orbit energy can be 
neglected. This is analogous to the earlier assumption that different 
configurations are sufficiently well separated by the central field so that 
their mixing due to the electrostatic energy can be neglected. States of 
different J in the LSJM representation are now split apart by the spin- 
orbit energy; the energy is still independent of M , so that there are 2 J + 1 
degenerate states. A Russell-Saunders state is usually written in the 
form 4 Dj, where the superscript is the multiplicity 2S + 1, the letter 
(now capitalized) is thei value according to the code given earlier in this 
section, and the subscript is the J value; in this case S — L = 2, ami 
J = i- Since J = L + S, the argument at the end of Sec. 24 shows that. 
J can only be one of the numbers L + 8, L + l S Y — 1, . . . , \L — S\. 

Selection Rules, The selection rules in the Russell-Saunders case 
can be obtained from the discussion of Sec. 37. Only one electron is 
involved in a transition, so that the configuration changes through il 
change in one of the Vs by one unit; this also changes the parity. Since 
the electric-dipole moment does not involve the spins, and the spin func¬ 
tions for different S are orthogonal (see Prob. 4), S does not change in ail 
allowed transition. The conservation of angular momentum between 
atom and radiation field further requires that J and Ij each change by 
1 or 0. Transitions between states both of which have J = 0 are strictly 
forbidden. 

Intersystem lines that join states of different multiplicity (change in 
S) sometimes occur, and indicate a partial breakdown of LS coupling. 
The very intense mercury resonance line at 2,537 A is an intorsystnm 
line: a Pi -> hSo. This transition is allowed so far as the changes in J, L, 



Sec. 39] 


ATOMS, MOLECULES , AND ATOMIC NUCLEI 


279 


configuration and parity are concerned, but not as regards the change in 
S. The IPi state is partially mixed by the spin-orbit energy with a 
higher singlet (£ = 0) state of the same J and parity, and this makes an 
electric-dipole transition possible. 

jj Coupling Scheme. The opposite approximation to that involved 
in LS coupling assumes that the spin-orbit energy is large in comparison 
with the electrostatic energy. If the latter is neglected, each electron 
can be characterized by the quantum numbers nljm rather than nlmtm a , 
where (L* + S*) 2 = j(j + 1)A 2 and Lk Z +* 8b * = wA. The electrostatic 
energy then splits apart states of different J. 

This is called the jj coupling scheme , since the orbital and spin angular 
momenta of the individual electrons are coupled together to form j’ s, 
from which the states are built up. It is mainly of interest in heavy 
atoms, where the large V(r) makes the spin-orbit energy (38.13) the 
dominant perturbation. 


39. THE ALKALI ATOMS 

The ground-state configuration of an alkali atom consists of a series 
of full shells followed by a single s electron, and so is 2 $j. The inner rare- 
gas configuration is so stable that all but quite high excited states of the 
atom involve only the valence electron. Thus the alkalis can be treated 
to quite good approximation in terms of a model in which a single electron 
moves in a spherically symmetric non-Coulomb potential energy V(r). 
In this section wo calculate the energy levels and the intensities of allowed 
transitions in the absence and presence of an external magnetic field. 

Doublet Separation. The configuration of an alkali atom can be 
specified by a single pair of quantum numbers nl. Since there is only 
one electron, the perturbing electrostatic term mentioned in the last sec¬ 
tion does not appear. In the absence of external fields the Hamiltonian, 
including the spin-orbit energy (38.13), is 

// = - £ V* + 7(f) + t(r)L • S (30.1) 

where £(r) is given by (38.14). As in Sec. 38, we neglect the mixing of 
different configurations produced by the spin-orbit, energy, and regard 
this term as a perturbation that removes the mi?n a degeneracy within each 
configuration. The total angular momentum J = L + S of the valence 
electron is a constant of the motion (see Prob. 5), so that states can be 
designated by jm instead of Wi?/?., where J 2 = ;/(;/ + 1)A 2 and J z = wA. 
The states of different j have different energies, but there is still a (2;/ + l)- 
fold degeneracy due to rn. The removal of the m degeneracy by a mag¬ 
netic field is discussed later in this section. 
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The difference in energy between states of different j is due to the 
L * S term in (39.1), and can be found from its expectation value or 
diagonal matrix element [see Eq. (25.8)]. We have the operator relation 


J 2 = (L + S) 2 = L 2 + S 2 + 2 L • S (39.2) 

Since l, j, and s are all good quantum numbers (s = -J- for one electron), 
Eq. (39.2) can be solved for the diagonal matrix element of L * S 

(L ■ S)*j,z 3 * = ib'U + 1) - Kl + 1) - IW (39.3) 

Now if l is different from 0, j can be either l + i or l — -J. Thus the 
first-order perturbation arising from f(r)L ■ S is 


iltni if j = l + $ 

+ l)f*z if j = l — i 

fi = A s / 0 “|/2^)| s f(r)r 2 *, l > 0 


(39.4) 


where R n i(r) is the normalized radial part of the unperturbed eigenfunc¬ 
tion associated with the nl configuration. Since V (r) represents an 
attractive potential energy, £(r) given by (38.14) is positive and f*z is 
positive. Thus (39.4) shows that the state with higher j has the higher 
energy. The pair of states is called a doublet; the doublet structure 
characterizes all the moderately excited levels of the alkali atoms except 
those for which l = 0, in which case j can only be 

The doublet separations can be calculated from (39.4) if the radial 
function is known. We can get a rough estimate of their dependence on 
n by using the hydrogenic wave functions given in Eq. (16.24), and assum¬ 
ing that 7(r) has the Coulomb form —Ze^/r. Substitution into (38.14) 
and (39.4) gives, with the help of the generating function (16.21) for the 
associated Laguerre polynomials, 


. 1 WM . 

2mVJo r R »iW dr 
c 2 h 2 Z 4 

" 2niVa ii 3 nH(l + M + T) (39 ' 5) 

This is valid only for Z > 0; the singularity in £(r) at r = 0 makes the 
integral for f n0 diverge there, so that the perturbation approximation in 
not valid. It follows from (39.4) and (39.5) that the doublet separation in 
proportional to ft -3 , and this is in fair agreement with observation. The 
absolute value of the doublet separation and its dependence on l are not 
given at all by this simple theory, since the effective Z is difficult to 
estimate and depends markedly on l because of penetration. 1 


1 The effect of the spin-orbit interaction on the energy levels of hydrogen is found 
in the next chapter as part of an exact relativistic calculation. 
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Doublet Intensity. We now calculate the relative intensities of the 
two lines of the allowed doublet 2 P 8 —> 2 £* and 2 P* —> 2 S 4 , under the 
assumption that the radial wave functions are the same for the two excited 
2 jP states. Transitions of this type give rise to the principal series in the 
alkali spectra.- From Eq. (36.22), the spontaneous transition probabili¬ 
ties, and hence the observed intensities if the two P states are equally 
likely to be occupied, are proportional to the squares of the dipole matrix 
elements. 1 

The dependence of the two excited 2 P states and the ground 2 S state 
on the angular and spin coordinates of the electron is obtained by finding 
linear combinations of products of the four spherical harmonics 7 i( i(0,<£), 
7i iO (0,<f>), 7i,_ 1 (0,*) and To,o(0,tf>), and the two spin wave functions (+) 
and ( —), that are eigenfunctions of J 2 and J z (see Secs. 14 and 33 for dis¬ 
cussion of the angle and spin functions). These combinations can be 
obtained from the spin functions for three electrons given at the end of 
Sec. 33. We replace S x by S, S 2 + S 8 by L, and the spin functions (33.6) 
for the second and third electrons by the spherical harmonics Y 1 , 1 , 
Fi t o, 7i ( -i, and 7 0 ,o, respectively (see Prob. 3, Chap. IX). This gives at 


once 


m = % (+)7i,i 

* 3->[2K+)7i, 0 + (—)7 lil ] 

-i 3-*[2K“)7i,o + (+)7 W ] 
-i (-)*Vi 

m » i 3-i[(+)7 lt o-2K-)7i tl ] 

-* 3-»[(-)7 li0 -2K+)7 1( _i] 

* (+)7 0 ,o 




m 


(39.6) 


The wave functions (39.6) can be used to calculate the matrix ele¬ 
ments of x = r sin 0 cos <£, y = r sin 0 sin <£, and z = r cos 0. We assume 
that the radial functions associated with (39.6) are all the same, so that 
the radial part of the matrix-element integral is a common factor through¬ 
out. The angle parts of the integrals are easily evaluated by making use 
of the explicit expressions for the 7 ; s in terms of 0 and <f> given in (14.16). 2 
The products of spin functions follow the simple rules (+)*(+) = 1, 
(■—)*(+) = 0, etc. In this way we obtain the following values for the 
squares of the magnitudes of the indicated matrix elements, expressed in 
units of tV of the common radial factor: 

1 The energy difference between the two upper states is so small that the to 8 factor 
in (36.22) does not affect the intensity ratio appreciably. 

a In the general case in which VVs with l > 1 arc involved, it is often easier to 
use a formula for the integral of the product of three spherical harmonics given by 
J. A. Gaunt; see Condon and Shortley, op. cit., p. 176. 
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(39.7) 


Similar results are obtained for the transitions that start from m = — -J 
and — f; altogether, they confirm the m selection rules of Sec. 37. 

It follows from (39.7) that the sum of the intensities of all the lines 
that originate on each of the four 2 P S states is equal to 6, in the above 
units. It is to be expected that these sums are equal since the four values 
of m differ only in the orientation of the angular momentum, and this 
should not affect the intensity. However, the total intensity from each 
of the two 2 P* states is also equal to 6. The equality of total intensities 
from each state formed from a given L and S is a general property of LS 
coupling; this makes the observed intensity, which is that from all the 
states that are degenerate with respect to 7ft, proportional to 2J + l. 1 
In the example considered here, the two lines of the doublet have inten¬ 
sities in the ratio 2:1. This is observed for the lowest doublets of the 
alkalis, although for the higher doublets, the intensity ratio exceeds 2. 
This is because the spin-orbit energy actually mixes different configura¬ 
tions ( 2 P states with the same j but dilferent n ); the amount of mixing is 
different for the two j values, so that the two radial functions are not the 
same. A small admixture of the low-intensity upper states in the high- 
intensity lowest 2 P states has little effect, whereas in the opposite case, 
there is a large effect on the doublet intensity ratio. 2 

Effect of a Magnetic Field. 8 We now consider the effect of a magnetic 
field on the energy levels and transition intensities of an alkali atom. A 
constant magnetic field H can be represented by the vector potential 


A = (iH X r) (39.8) 

since H = curl A. The divergence of (39.8) is zero, so that the terms 
involving A that appear in the Hamiltonian (23.24) are 


ieh 


— : A * grad + 

me 2 me 1 


^A 2 


= ~ 2 ^ (H><r) - p+ » (HXr) -( HXr) 


2 me ^ ^ 8me 2 


|_|2 r 2 g J n 2 0 


(39.9) 


1 Condon and Shortley, op. at., p. 238. 

2 E. Fermi, Zrits.f. Phymk, 69, 680 (1929). 

3 W. Heisenberg and 1\ Jordan, Zeils. f. Physik., 37, 263 (1926). 
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where L = r X p and 0 is the angle between r and H; e is the electronic 
charge, and hence is a negative number. 

The electron also has an intrinsic magnetic moment in the direction of 
its spin axis. The magnitude of this moment can be determined from 
comparison between experiment and the theory of the Zeeman effect 
presented below, and is in agreement with the value deduced from Dirac's 
relativistic theory of the electron (see Chap. XII); it is ch/2mc, or e/mc 
times the spin angular momentum of the electron. This is twice the 
ratio of magnetic moment to angular momentum of a classical charge 
distribution for which the ratio of charge to mass density is constant. 
The magnetic moment is (e/mc) S, and the extra energy in a magnetic 
field is 

- — H • S (39.10) 

me 

The ratio of (39,9) to the kinetic energy is quite small for magnetic 
field strengths commonly attainable in the laboratory (see Prob. 7). It iH 
therefore permissible to use perturbation theory to find the effect of the 
H terms on the wave functions and energy levels. In most cases, only 
the linear terms need be considered. However, for very strong fields 
and large orbits, the quadratic terms can become of interest (see the dis¬ 
cussion of the quadratic Zeeman effect below). Also, the diamagnetic 
susceptibility can be obtained from the terms in the energy that are 
proportional to H 2 . 

Weak-field Case. For the present, we consider only the first-order 
effects of H. The Hamiltonian (39.1) then becomes, with (39.9) and 
(39.10), 

H = ~ £ V ' + 7 W + • S + + 2 *->.« - ” (39-11) 

where the field is along the z axis. The magnetic field can now be clas¬ 
sified as weak or strong according as the last term in (39.11) is small or 
large in comparison with the spin-orbit energy. The Zeeman effect usu¬ 
ally refem to the weak-field case, and the Panchen-Bwk effect to the 
strong-field case, although the term Zeeman effect is sometimes used to 
include all magnetic effects. 

In the weak-field ease, we can make use of the wave functions (39.0), 
which are eigenfunctions of J 2 and J s . It is easily verified that the 
magnetic energy e(Jj c + 2/?,) = e(J a + Si) lias matrix elements between 
states of different j, but not between states of the same j and different m. 
We neglect the former, because of the relatively large energy separation 
between stai.es of different j. Thus the magnetic energy is diagonal with 
respect to m for eachand shifts the energy of each of the states (39.0) 
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by its expectation value for that state. In each case, J , is diagonal, so 
its expectation value is rah. The expectation value of S, for the ®P| 
state with m = i, for example, is 

J73-*[2*(+)*?i,o + (-)*?u]*for.3-*[2»(+) 7 ll0 + (-)Fi.i] sin 9ded<t> 

= %fj [2K+)'*?i.o + (—)*?i,i][2 i (+)F : 1,0 - sin AZAty 

= ^(2-D=| 

with the help of (33.5) and the orthonormality of the spin functions and 
the Y’&. Thus the magnetic energy of this state is + i) = fah- 
This and the similar results for the other states (39.6) can be expressed in 
terms of the Lands g factor; the magnetic energy is 

ehmg /on i o\ 

g = i for 2 P h g = * for *P 4 , g = 2 for 2 £ 4 1 ; 

The weak-field transition intensities are given directly by (39.7). 
According to the discussion of Sec. 37, the radiation from the transitions 
in which m changes by unity is circularly polarized when viewed along 
the field and linearly polarized perpendicular to the field when viewed in 
the xy plane; these are called the <r components (from the German senk - 
recht). When m does not change in a transition, the radiation does not 
appear along the field, and is polarized parallel to the field (nr components) 
when viewed in the xy plane. For observation in the xy plane, the tt 
intensity is proportional to |z| 2 in (39.7) and the a intensity is proportional 
to either \x\ 2 or \y\ 2 (but not to their sum). 

Strong-field Case. If the magnetic energy is large in comparison 
with the spin-orbit energy in (39.11), the field is said to be strong. In 
this case the states within a given nl configuration are better specified by 
mi and m, than by j and m as in (39.6), The magnetic energy is then 
diagonal and has the value 

eh(mi + 2 ra a ) (39.13) 

If the spin-orbit-energy is neglected for the moment, the eight wave func¬ 
tions that correspond to (39.6), and their energy shifts (39.13), are 

(+)F U 2 eh 
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In the event that the magnetic field is very strong, the spin-orbit 
energy is most simply treated as a perturbation on the wave functions 
(39.14). We consider instead the general case, which includes all relative 
magnitudes of the magnetic and spin-orbit energies. This is done by 
working with the matrix of the last two terms in (39.11) in either of 
the representations (39.6) or (39.14). The eigenvalues of the matrix are 
the energy levels, and the transformation that diagonalizes the matrix 
gives the wave functions in accordance with the discussion of Sec. 22. 
We start from (39.14), and notice at once that the two 2 S wave functions 
are the same as the 2 S* functions of (39.6). We ignore the effect of the 
spin-orbit energy on these two states, since it does not shift them with 
respect to each other; the energy shifts due to the magnetic field are 
±eft. Similarly, the first and last of the six 2 P wave functions are the 
same as the 2 P a functions of (39.6) with m = ±4; their energies are 
if ± 2fift, where f is given by (39.4). 

The four remaining 2 P wave functions combine in pairs, according to 
whether m = mi + m 8 is equal to i or — i. It is enough to consider 
just one of these pairs, say that for which m = i: (+)7i,o and ( —)7i,i. 
The matrix of the magnetic and spin-orbit energies in the representation 
specified by these two states can be found with the help of the angular- 
momentum matrices (24.15). 


/ eh 2-*f\ 

\2-*r -if/ 


(39.15) 


In accordance with the discussion of Eq. (21.19), the eigenvalues of the 
matrix (39.15) are found by solving the secular equation 

|eh - X 2-if 


2-if 


-if -x 


= X 2 + (if — eft)X — if (eft + f) — 0 


In this way we obtain for the energy shifts of these two states 


x± = i[eft - if ± (e 2 ft 2 + eftf + if 2 )*] (39.16) 

In the weak- and Hlrong-field limits, the upper and lower signs in (39.16) 
lead to 


X+—► if + left and 
X + —> eft and 


X- —► — f + ieft for --—►() 

1 (39.17) 

X_— - if for 4-0 


Equations (39.17) show that the state that corresponds to the upper sign 
in (39.16) is the weak-field state j = f, m = i and the strong-field state 
mi =b 0, m fl = i. Similarly, the lower sign in (39.16) corresponds to the 
weak-field state j = i, m = i and to the strong field state rrii = 1, 
m, = - i. 
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The transition intensities can be found in the general case by calcula¬ 
ting the matrix elements of x, y , and z with the help of the eigenfunctions 
of £(r)L • S -b c(I/* + 2S g ). These eigenfunctions are the first, sixth, 
seventh, and eighth of (39.14), and linear combinations of the other four 
functions that are obtained from the matrix that diagonalizes (39.15). 

Quadratic Zeeman Effect. For very strong magnetic fields and large 
orbits or n values, effects of second order in H become appreciable. From 
(39.5), it is apparent that the effect of the spin-orbit energy becomes very 
small, for large n , and a useful approximation is obtained by neglecting 
this part of the energy entirely. In this case the electron spin commutes 
with the Hamiltonian, so that m a is a constant of the motion, and the spin 
can be ignored. The Hamiltonian (39.11) is then replaced by 

H = - V 2 + V{r) +eL, + sin 2 9 (39.18) 

Since L n — — ihd/d<}> commutes with (39.18), ra { is a good quantum 
number, and the only effect of the term tL, is to displace each energy level 
by the amount ehmi. Thus for large n, we need only be concerned with 
the effect of the last term H' s= sin 2 d in (39.18), for particular 

values of m 2 and 

It follows from the work of Sec. 16 that the effective radius of a hydro¬ 
gen atom is roughly proportional to n 2 . For alkali-atom states of large 
n, V(r) has practically the Coulomb form, and the wave functions are 
very nearly hydrogenic functions. Thus H' increases about as n 4 . This 
means that n is no longer a good quantum number for sufficiently large n. 
For smaller n, l may not be a good quantum number since H' has off- 
diagonal matrix elements between states of different l, and the unper¬ 
turbed energies of these states lie close together (they fail to be degenerate 
only because the wave functions for the smallest values of l penetrate 
the inner full shells). In this region, the perturbed energy levels can be 
found by diagonalizing the matrix of H' for given values of n, mi, and 
m B , when the n — |mj| rows and columns are labeled by L The structure 
of the H' matrix can be inferred from Gaunt’s formula (footnote 2, page 
281); since sin 2 0 can be expressed in terms of spherical harmonics of order 
0 and 2, the only nonvanishing matrix elements H[ v are for l — V = 0, ± 2. 
Thus the H' matrix has the form (if for example, mi = 0) 

0 tf' 02 0 0 

0 H' n 0 H’ u 0 

ffio 0 H' %% 0 h; 4 

0 JF al 0 Hi 3 0 

0 0 H' 42 0 H' 44 

1 L. I. Schiff and H. Snyder, Phys. Rev., 66, 69 (1939). 
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The matrix (39.19) is equivalent to two independent matrices, one 
for even and the other for odd Z, each with about rows and columns. 
Direct diagonalization of these would be quite arduous for large n . How¬ 
ever, the resulting energy levels are so close together that they cannot 
be resolved spectroscopically, so there is little reason to determine the 
individual levels. What can be observed is the aggregate of transitions 
between the 2 S ground state (l = 0) and the group of states that are 
obtained by diagonalization of (39.19); these appear to be a single 
broadened “line.” Allowed transitions occur only by virtue of the state 
with l = 1 that is mixed into each of the eigenfunctions of (39.19), so 
that mi can be only 0 or ±1. This makes it possible to find the center of 
gravity of this line and its mean-square breadth without diagonalizing 
if', as we now show. 

The unperturbed wave functions can be chosen so that if' is a real 
matrix. Then the unitary matrix S that diagonalizes H' can be real, so 
that Eq. (21.20) or Eq. (22.6) can be written 

SH'S* = E (39.20) 

where E is diagonal. In terms of matrix elements, this equation is 

SikH'kiSji = Ei8u 

The new eigenfunctions m that correspond to the energy eigenvalues Ei 
are given in terms of the unperturbed wave functions Vi by (22.3) 



If now we neglect the dependence of the radiative transition probability 
on energy over the small range of energies involved in this group of 
states, the transition probability is proportional to the square of the 
amount of v t in each , or to a?*?. Thus the energy levels Ei should bo 
weighted in proportion to Si{. The center of gravity of the group of 
perturbed energy levels is given by 

E m = ^ EM = tf'n 

i 

since (39.20) can be inverted to give IV = S*ES. In similar fashion, 
the mean-square breadth of the line is 

2 (Ei - = 2 

i 

= y n[\ - eu = an 

T 


l 
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Thus only two of the matrix elements of H' need be calculated. It is 
apparent that both the displacement (apart from the factor ehmi) and 
the breadth of the line are proportional to H 2 . 

40. MOLECULES 

Molecules are considerably more complex in structure than atoms, 
and correspondingly less has been accomplished in the quantitative 
application of quantum mechanics to molecular problems. In this sec¬ 
tion, a general account of the nature of molecular energy levels is followed 
by a simple explicit calculation for the hydrogen molecule and a somewhat 
more general treatment of diatomic molecules. 1 

Classification of Energy Levels. The simplifying feature that is the 
basis of all molecular approximations is the large ratio of nuclear mass to 
electron mass. As we shall see shortly, this implies that the energy 
associated with the motion of the nuclei is much smaller than that associ¬ 
ated with the motion of the electrons about the nuclei. Since the period 
of a motion is of the order of h divided by its energy, the nuclear periods 
are correspondingly longer than the electronic periods. It is then a good 
approximation to regard the nuclei as fixed in calculating the electronic 
motion. Moreover, the nuclear motion can be calculated under the 
assumption that the electrons have their steady motion for each instan¬ 
taneous arrangement of the nuclei '(adiabatic approximation). 

The nuclei are expected to have a stable equilibrium arrangement 
somewhere between a completely collapsed structure (which is unstable, 
since the nuclei are positively charged and repel each other at short dis¬ 
tances) and a completely dispersed structure (which is not the most stable 
structure if a molecule exists). The nuclear motions can then be classified 
into translations and rotations of the quasi-rigid equilibrium arrange¬ 
ment, and internal vibrations of the nuclei about equilibrium. As with 
atoms, the translational motion is the same as that of a free particle 
[see the discussion of Eqs. (16.5)], and gives rise to no nonclassical 
features. 

We thus arrive at a classification of molecular energy levels into 
electronic , vibrational , and rotational types, and proceed to estimate their 
relative orders of magnitude. Suppose that the molecule has linear 
dimensions of order a. Then the energy E 9 associated with the motion 
of a valence electron (one that occupies roughly the whole of the molecular 
volume, rather than one that is bound in an inner shell close to a nucleus) 

1 For more dotailed discussions, see G. Herzberg, “Molecular Spectra and Molec¬ 
ular Structure” (Prentice-Hall, New York, 1939); L. Pauling and E. B. Wilson, Jr., 
“Introduction to Quantum Mechanics,” Chaps. X, XII, and XTII (McGraw-Hill, 
New York, 1935). 
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is of order ft 2 /ma 2 , where m is the electronic mass. This can be inferred 
by an argument like that given near the beginning of Sec. 9, by noting 
that the momentum uncertainty of the electron is at least of order ft/ a, so 
that its minimum kinetic energy is h 2 /ma 2 . We thus obtain 


E e 



(40.1) 


For values of a of the order of a few angstrom units, this corresponds 
to transition frequencies in the visible and ultraviolet regions of the 
spectrum. 

To estimate the vibrational energy, we regard each of the normal 
modes as a harmonic oscillator with which is associated a mass M and a 
stiffness constant K 0 . M will be of the order of a typical nuclear mass. 
Kq can be estimated by noting that a displacement along a normal mode 
by the order of the molecular size a must produce an energy change of 
the order of the electronic energy E ey since such a large displacement 
would produce a substantial distortion of the electronic wave function; 
we thus put K q E e /a 2 . Then the energy E v associated with a fairly 
low mode of vibration is, from (13.8), 


E v 


, {K 0 \* ¥ 

\M) 2 



(40.2) 


where use has been made of (40.1). E v is roughly a hundred times smaller 
than E 0 , and corresponds to transitions in the near infrared. 

The rotational energy E r can be estimated from the moment of inertia 
of the molecule, which is of order Ma 2 . As would be expected, the 
angular momentum of a fairly low mode of rotation turns out to be of 
order ft, so that 


E r 


ft 2 

Ma 2 



(40.3) 


This is about a hundred times smaller than E V} and corresponds to transi¬ 
tions in the far infrared. 

It might be expected from Eqs. (40.2) and (40.3) that the electronic, 
vibrational, and rotational energy levels can be obtained as successively 
higher orders in an approximation that is based in some way on the small 
ratio m/M (which is usually in the range 10“ 3 to 10“~ 4 ). This was shown 
to be the case by Bom and Oppenheimer. 1 They used as expansion 
parameter the ratio of a typical nuclear vibrational displacement to the 
internuclear distance (which is of order a). An oscillator of energy 


1 M. Bom and J. R. Opponheimer, Ann. d. Phy&ik , 84, 467 (1927). 
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E v and stiffness constant K Q has a displacement of order 



so that the expansion parameter is 

(l)‘~(5)‘ <40 ' 4 > 

In terms of this, the electronic energy is of zero order, the vibrational 
energy of second order, and the rotational energy of fourth order; the 
first- and third-order energies vanish. 

Wave Equation. The time-independent Schrodinger equation for a 
molecule is readily written down 

+ <40 - 6) 

»«1 1 

There are n electrons and N nuclei, and V is the sum of the electrostatic 
interactions between all pairs of them. It is apparent that the nuclear 
kinetic-energy terms are of fourth order in the expansion parameter 
(40.4). If they are neglected, the wave function \p involves the nuclear 
coordinates R,- only parametrically, and (40.5) is a wave equation in 
the ?i for the motion of the electrons with respect to nuclei that are fixed 
in space. In this case, \p is approximately and corresponds to 

the energy eigenvalue Z7(R,-). The nuclear motion can then be found by 
regarding t/(R?) as a potential function and using it to obtain a nuclear 
wave function w(R*). 

We therefore write \p in the form 

^(r»,R,0 = UK } (ti)w(Rj) (40.6) 

where u satisfies the equation 

(~ X v,?+ v ) MR,(r<)=c/(R ’' )wR ' (r<) (4o - 7) 

1 

For each arrangement of the nuclei, U (R/) is obtained as an eigenvalue 
of Eq. (40.7). There will, in general, be several solutions that corre¬ 
spond to different electronic states of the molecule; care must be taken to 
ensure that u and U change continuously with R,-, especially if the system 
is degenerate. Substitution of (40.6) into (40.5) gives, with the help of 
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(40.7), 

N 

[-Xar. 7 > + P(B,) ]*"* 

y-i 

which can be rewritten 

N 

W*) [ - ^ + U(Ri) - 2 ?] w(Ri) 

y-i 

N 

= ^ ^ KRi)V?u H ,(r<) + 2 grad* w(R,) ■ grad,- u R ,(r ( )] (40.8) 

If now the dependence of u on R* is neglected, the right side of (40.8) 
drops out, and an approximate wave equation for the nuclear motion is 
obtained: 

N 

[ ■- V V}+ U(fc)] w(R#) = Bw(K) (40.9) 

y-i 

The neglect of the grad, u terms derives physically from the smallness of 
the amplitudes of the nuclear motion in comparison with the equilibrium 
intemuclear distances [smallness of the expansion parameter (40.4)]; 
this implies that the electronic part u of the wave function does not change 
much as the nuclei move. Born and Oppenheimer have shown formally 
that this approximation is justified so long as not too high vibrational 
and rotational modes are excited. 

The Hydrogen Molecule. It is clear from the foregoing discussion 
that two distinct problems arise in connection with molecular structure. 
The first is the solution of Eq. (40.7) to obtain electronic wave functions 
and a potential energy function U(Rj) of the nuclear coordinates. The 
second is the solution of (40.9) for the nuclear motion. The first prob¬ 
lem can be solved only in the simplest cases. As an example, we now 
consider in outline an approximate solution for the hydrogen molecule 
due to Heitler and London. 1 Following this, we discuss the solution of 
Eq. (40.9) for a general diatomic molecule, making simple assumptions 
concerning the potential energy (/. 

The only nuclear coordinate R/ that appears in (40.7) in the case of 
the hydrogen molecule is the magnitude 11 of the distance between the 
two hydrogen nuclei. The Hamiltonian is that given in Eq. (27.11) 


1 W. Hcitlur and K. London, Zeiln.J. Phyaik , 44, 455 (1027). 
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(see Fig. 24); however, R is no longer large in comparison with 



so that the approximations implied in (27.11) and (27.12) are no longer 
useful. Nevertheless, an approximate wave function based on a simple 
product of two ground-state hydrogen-atom functions gives remarkably 
good results. The reason for this is that exchange degeneracy (see Sec. 
32) is taken into account: the degenerate wave functions for which elec¬ 
tron 1 is on nucleus A and electron 2 on nucleus B, and for which electron 
1 is on nucleus B and electron 2 on nucleus A, are both used at once. 
The new feature of the work of Heitler and London was the recognition 
that an appropriate linear combination of unperturbed degenerate wave 
functions gives a significantly lower energy than the separate wave 
functions; it is the basis of the present-day theory of homopolar binding in 
molecules. This property of degeneracy iB sometimes referred to as 
resonance . Just as an interaction between two classical oscillators that 
are in resonance (same unperturbed frequency) gives rise to a normal 
mode that has a lower frequency (and also one that has a higher fre¬ 
quency), so does an interaction between two resonant (degenerate) 
states in quantum mechanics give rise to a lower energy eigenvalue (as 
well as to a higher one). 1 There may of course be more than two degen¬ 
erate unperturbed states, and the degeneracy need not be of the exchange 
type. 

Potential-energy Function. Equation (40.7) for the hydrogen mole¬ 
cule is 


[H - J7(22)]^(r!,r 2 ) - 0 

H = - N (Vf + VI) + e 2 (4 + ----- 

2m \R r i 2 j-u r«„ 


±.±\ <4010) 
Tin T 2A ) 


We wish to base our approximate calculation of U(R) on the approximate 
wave functions 


i/i(ri,r 2 ) = u a (ti)ub(t 2 ) 
'w 2 (r 1 ,r 2 ) = ua(t 2 )ub(ti) 


(40.11) 


where Ua and u B are ground-state hydrogen wave functions [wioo in the 
notation of (16.24)] based on nuclei A and 7?, respectively. It must 
first be noticed that the U\ and u 2 of (40.11) are eigenfunctions of different 
unperturbed Hamiltonians, so that the degenerate perturbation theory 
of Sec. 25 is not applicable. This is unlike the situation with the helium 


1 This use of the word resonance is only remotely related to that which appeared in 
Bee. 19 in connection with scattering. 
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atom [see the discussion of (33.7)], where the two exchange-degenerate 
wave functions are solutions of the same unperturbed Hamiltonian. 

We can, however, use the variation method of Sec. 27, in which case 
it is natural to adopt as the trial function an arbitrary linear combination 
of U\ and u 2 

^(ri,r 2 ) = ui(t h u) + Au 2 ( ri,r 2 ) (40.12) 

where A is the variation parameter. Substitution of (40.12) into (27.5) 
gives 


TTfT)\ ^ (1 + A 2 )H 11 + 2AHu 

U( - R) = -1 + A 2 + 2 Ay ' '* 

Hu = H 22 = IJuiHuidT idr 2 , H u 


y s fjuiudndn ^ 

= H 2 i s f SuiHu 2 dTidT 2 


These equalities between matrix elements are easily established with the 
help of Eq. (22.10) when it is remembered that the u’s are real, and that 
H is Hermitian and symmetrical in the two electrons. 

The matrix elements and 7 depend on R. For any particular value of 
R, the derivative of the right side of (40.13) with respect to A is 


2(1 - A 2 )(ff 12 - yH u ) 
(1 + A 2 + 2Ay) 2 


which is zero for A = ±1. Since the right side of (40.13) is equal to 
Hu when A is — <x>, 0, and + go, one of the points A = ± 1 must be a 
minimum and the other a maximum. The integrals in (40.13) can be 
expressed in terms of tabulated functions, and the minimum expectation 
value of H is obtained with A = +1: 


U(R) g 


= Ui + u 2 
Hu + Hu 


1 + 7 


(40.14) 


The upper limit on U{R) given in (40.14) has the general appearance that 
is characteristic of the internu clear potential energy of a diatomic mole¬ 
cule (see Fig. 30) and is in good agreement with experiment. 1 Since \p in 
(40.14) is symmetric in an interchange of the space coordinates of the two 
electrons, it must be multiplied by the antisymmetric singlet spin func¬ 
tion given as the last of lOqs. (33.0). 

It is interesting to compare the symmetiy characters of the ground 
state of the hydrogen molecule and the excited states of the helium atom 
considered in Sec. 33, from a physical point of view. Because of the 
exclusion principle, electrons must be in different space states if they 
have parallel spins, and hence tend to keep away from each other. In the 


1 Pauling and Wilson, op. cit ., Sec. 43a. 
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excited ls2s state of helium, for example, this reduces the electrostatic 
repulsion of the electrons and lowers the energy. Thus the triplet states 
of helium tend to lie lower than the singlet state of the same configuration. 
(The situation is different for the ground state, since only the singlet state 
can exist for the Is 2 configuration.) In the ground state of the hydrogen 
molecule, on the other hand, the lowest energy (strongest binding) is 
obtained when the electrons tend to concentrate between the two nuclei, 
since then the repulsion between the electrons is more than compensated 
by the attraction of both nuclei for each electron. This occurs when the 
electrons can occupy the same space state and hence when they have 
antiparallel spins. Thus it is the singlet state that leads to a stable 
molecule. 

The Morse Potential. We now turn to diatomic molecules in general, 
and consider the nature of the solutions of Eq. (40.9) for the nuclear 
motion. If the nuclei have masses M i and M 2 and their relative position 
vector R has polar coordinates 12,0,0, the equation for the relative motion 
becomes [see Eqs. (16.5)] 

“ v2 + U( - R) ] w( - R ’ d >^ = Bw&M) ( 40 - 15 ) 

where M = Miikf 2 /(Mi + -Ms) is the reduced mass. 

It has been found by experience that the potential-energy function 
for the lowest electronic states of actual diatomic molecules can be repre¬ 
sented quite accurately by a simple analytic function that contains three 
adjustable parameters 


U(R ) = U o e 


2(rt-fio) 


(R-Ro) 


- 2e 


(40.16) 


Equation (40.16) represents the Morse potential, 1 and is drawn in Fig. 30. 
U approaches zero exponentially for large R, has the minimum value 
— Uo at R = Ro , and becomes large and positive as R approaches zero if 
the “breadth” a of the attractive region is somewhat smaller than the 
equilibrium distance Ro. 

Figure 30 has the general appearance that would be expected for a 
diatomic molecule. The zero of energy is arbitrarily chosen to be zero 
when the neutral atoms are far apart; then U becomes negative at first 
as R decreases, because of the van der Waals attraction. 2 For smaller R 
this is replaced by the much stronger Heitler-London resonance attrac- 


1 P. M. Morse, Phys. Rev., 34, 57 (1929). 

2 One of the inaccuracies of the Morse potential is its replacement of the 1 /R* 
van der Waals term (see the end of Sec. 27), by an exponential; however, the behavior 
of U at such large R has little influence on molecular energy levels. 
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tion. Ab R continues to decrease, the close approach of the nuclei (or 
ionic cores) gives rise to a repulsion that causes U to increase and eventu¬ 
ally become large and positive. 1 

Rotation and Vibration of Diatomic Molecules. Equation (40.16) 
can be separated in spherical coordinates, as was Eq. (14.1), to give 

w(RM) = ^ Y KMk (6,4>) 


K and M K are the angular-momentum quantum numbers that are 
analogous to l and m , respectively, for a single particle in a central field. 



The radial equation is 


h 2 d 2 x 
2MdRi 


+ W(R) X = Ex 


(40.17) 


W(R) = U(R) + 


h 2 K(K + 1) 
2 MR 2 


K - 0 , 1 , 2 , 


Equation (40.17) is the wave equation for the ono-dimensional motion 
of a particle of mass M in a potential W(R), with the boundary condition 
that x vanish at R = 0. If K is not too large, the general shape of W 
resembles that of U shown in Fig. 30. In this case, we are primarily 
interested in vibrations of small amplitude about the minimum. We can 
then expand W about its minimum at iii, which is only the same as TZo 
if K = 0, to give 

1 V(R) =W« + ^K {) (R - R x y + b(R - Ri)* + c(R, - 7?i) 4 (40.18) 

1 Unlike the true interaction, t.he Morse potential is finite at R “ 0. 
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where higher order terms are neglected. If the 6 and c terms are also 
neglected and the domain of R is extended to — , the eigenvalues of 

(40.17) are those of a linear harmonic oscillator with an additive term 
TFo. This is a good approximation for moderate values of the rotational 
quantum number K and the vibrational quantum number v. A somewhat 
better approximation can be obtained by regarding the b and c terms in 

(40.18) as perturbations on the oscillator. Since the b term produces 
only a second-order effect (see Prob. 2, Chap. VII), whereas the c term 
appears in first order (its expectation value can be computed by matrix 
methods as in Prob. 6, Chap. VI), both make contributions to E that have 
the same order of magnitude. 

Energy Levels. The eigenvalues of (40.17) to lowest order in b and c 
are then 


E — TV o ft 




* ft 2 /) 2 

(» + *)- 44 m> +i) 2 + *i 


MK% 

+ mh [{v + + i] 


v = 0,1,2, 


(40.19) 


Wo, K 0 , b, and c can all be expanded in powers of K(K + 1), where the 
coefficients depend on the parameters of the function U(R). If U has 
the form (40.16), the following expressions can be obtained: 


Ri = Ro d - 


h*K(IC + l)a 2 
2MR s 0 Uo 


w _ p| »K(K + 1) 
Fo ~ ~ Un + —2MR\ - 


K 0 = 
b = 


2 Up 3^KjKj-J) a 

a* MRla 2 R 0 
_Uo 7Jh 

a 3 ' C 12a 4 


h*K\K + l) 2 a 2 
4-M*R$U a 

l 1 ~ s) 


(40.20) 


Only enough terms have been retained to give E correctly to second order 
in (v + i) and K(K + 1 ). 

The first of Eqs. (40.20) shows that the molecule stretches owing to 
rotation. The second equation is the equilibrium energy — Z7 0 , plus the 
rotational energy to second order. The first-order rotational energy is 
h 2 K(K + l)/2Jo, where Io =» MRl is the moment of inertia of the mole¬ 
cule about an axis perpendicular to the line joining the nuclei; this energy 
is the same as for a rigid rotator (see Prob. 12). The third equation 
includes the change in the stiffness due to stretching. The corrections 
for stretching in the anharmonic terms b and c can be neglected to this 
order. The second term on the right side of (40.19) can be expanded 
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with the help of the expression for K a to give 


(2U ,Y 
\May 


(» + *) 


1 - 


Zh*K(K + 1) 


Ro (* Bo)] 


4 MRlUo 

The last two terms in (40.19) give the second-order vibrational energy 

(- H + A) (» + i) 2 - + * )2 (40,21) 


since the constant factors cancel. 

It is apparent that the rotational and vibrational energy levels agree 
in order of magnitude with the estimates made at the beginning of this 
section. As either v or K increases, the spacing between levels becomes 
smaller than that predicted from the simple rigid rotator and harmonic 
oscillator. 

Effect of Nuclear Identity. In the event that the two nuclei of a 
diatomic molecule are identical, the wave function must be symmetric 
with respect to an interchange of their space and spin coordinates if the 
nuclei have zero or integer spin, or antisymmetric if they have half-odd- 
integer spin (see Sec. 33). The discussion of Sec. 14 shows that the par¬ 
ity of the nuclear wave function is determined by the angular function 
Ykm k (6,<P) } and is even or odd according as K is even or odd. An inter¬ 
change of the space coordinates of the two nuclei is equivalent to a change 
in sign of their relative position vector R, so that the parity determines 
the space symmetry of the wave function. We thus see that for nuclei 
of zero or integer spin, the spin function must be symmetric for even K 
and antisymmetric for odd K ; for nuclei of half-odd-integer spin, the spin 
function must be antisymmetric for even K ajid symmetric for odd K. 

The discussion of Sec. 33 shows that for two nuclei of spin Ih each, 
the total of (27 + l) 2 spin states can be divided into (7 + 1)(2J + 1) 
symmetric states and 7(27 + 1) antisymmetric states. Thus in a gas 
that is in statistical equilibrium, the ratio of the number of molecules 
with even K to the number with odd K will be (7 + l)/7 if 7 is zero or an 
integer, and 7/(7 + 1) if I is half an odd integer. 1 This effect gives rise 
to alternating intensities in the band (rotational) spectrum of homonu- 
clear diatomic molecules. Both the spin and the statistics of appropriate 
nuclei can be determined in this way, and the results are in agreement 
with the general statement in Roc. 33. 

41. ATOMIC NUCLEI 

The application of quantum mechanics to the investigation of the 
structure of atomic nuclei entails great mathematical complexities in all 

1 This ratio is, of coil ran, modified by tho Boltzmann factor if the spacing between 
rotational levels is not small in comparison with the thermal energy k T. 
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but the simplest cases. In this section we review briefly present-day 
ideas of nuclear structure and the nature of the formal difficulties encoun¬ 
tered. Following this, we consider some elementary properties of the 
nuclear two-body problem. 1 

General Properties of Nuclei. Atomic nuclei are now believed to 
consist of protons and neutrons, both of which are called nucleons . Other 
particles (mesons, electrons), which probably have at least a transient 
existence within nuclei, are usually ignored in structure theories. Pro¬ 
tons are nuclei of hydrogen atoms, and neutrons are particles that have 
about the same mass, no electric charge, and the same spin and statistics 
as protons (spin Fermi-Dirac statistics). A nucleus can be character¬ 
ized by its charge Ze , where Z is an integer and e is the positive proton 
charge, and its mass M, which is measured in units of ts of the mass of 
O ia (oxygen isotope of mass number 16). M is always found to be close 
to an integer A , called the mass number. The number of neutrons in a 
nucleus is equal to A — Z ; thus the deuteron (heavy-hydrogen nucleus) 
H 2 consists of one proton and one neutron, the alpha particle (helium 
nucleus) He 4 of two protons and two neutrons, and the gold nucleus Au 1B7 
of 79 protons and 118 neutrons. 

According to the theory of relativity, the difference between the sum 
of the masses of the Z protons and A — Z neutrons in a nucleus and the 
mass M of that nucleus, multiplied by the square of the speed of light, is 
the energy evolved when the separate nucleons are brought together to 
form the nucleus. This energy is called the binding energy of the nucleus, 
and is conveniently measured in units of a million electron-volts (Mev). 
The mass defect M — A is an approximate measure of the binding energy, 2 
and the packing fraction (M — A)/A is a correspondingly approximate 
measure of the binding energy per nucleon. The packing fraction is 
found to be nearly constant for the nuclei between oxygen and mercury, 
and is larger at both ends of the periodic system. Thus the binding 
energy per nucleon has a nearly constant value for most nuclei, and is 
about 8 Mev. 

Each nucleus has a quite well-defined radius R, which ranges from 
about 3 X 10~ 13 cm for the alpha particle to about 10“ 12 cm for the heavi¬ 
est nuclei. The radius can be specified in terms of the interaction 
between the nucleus and either a neutron or a charged particle (proton, 

1 For more detailed discussions, see the papers by H. A. Bcthe, R. F. Bucher, and 
M. S. Livingston, Rev. Mod. Phys., 8, 82 (1936); 9, 69, 245 (1937); G. Gainow, “Struc¬ 
ture of Atomic Nuclei and Nuclear Transformations” (Oxford, New York, 1937); 
H. A. Bethe, “Elementary Nuclear Theory” (Wiley, New York, 1947); L. Rnsenfeld, 
“Nuclear Forces” (Interseience, New York, 1948). 

a It is only approximate, since the proton and neutron masses are somewhat greater 
than unity and not quite equal to each other. 
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alpha particle). In the first case, collision experiments indicate that there 
is a separation distance beyond which there is no interaction and within 
which a neutron is strongly attracted by the nucleus. In the case of a 
charged particle, collision experiments show a distance within which the 
long-range Coulomb repulsion is dominated by a strong, short-range, 
attractive interaction. Some of the heaviest elements emit alpha 
particles radioactively, and the nuclear radius can be determined from 
the relation between atomic number, alpha-particle energy, and radio¬ 
active decay rate. The radii obtained from considerations of all these 
kinds are in substantial agreement. The volume per nucleon turns out 
to be nearly constant for all but the lightest nuclei; this can be expressed 
by the relation R = roA*, where r 0 ^ 1.4 to 1.5 X Id -18 cm. 

Nature of Nuclear Interactions. The constancy of the binding energy 
and volume per nucleon suggests that a nucleus is constructed somewhat 
like a drop of liquid, in which the binding energy and volume per atom (or 
molecule) are also constant. The liquid structure arises from the inter¬ 
atomic potential curve (see Fig. 30, page 295); the attractive region (near 
the minimum of the potential) produces cohesion, and the repulsion at 
short distances prevents collapse. Thus the interatomic distance (and 
hence the atomic volume) is constant; the interaction is confined to near¬ 
est neighbors, so that the binding energy per atom is constant. Theo¬ 
retical considerations basep on the binding and collision of a pair of 
nucleons (see below) indicate that there is probably no appreciable 
short-range repulsion in their interaction, which resembles instead the 
square well potential of Fig. 13, page 76. The question then arises as to 
what it is that prevents the nucleus from collapsing into a region whose 
dimensions are roughly independent of the mass number, with a conse¬ 
quent increase in the binding energy per nucleon as A increases. 

It might at first be thought that the exclusion principle would prevent 
collapse, since there cannot be more than four nucleons (two protons and 
two neutrons of opposite spins) in the same space state. That this is 
not the case can be seen in the following way. The discussion of the 
Thomas-Fermi atom model in Sec. 38 shows that the maximum and aver¬ 
age kinetic energies of a number of independent particles that obey the 
exclusion principle are proportional to the two-thirds power of the spatial 
density. Thus for a region of given size, the total kinetic energy is pro¬ 
portional to AK However, the total potential energy if each particle 
can interact with every other is proportional to the number of pairs of 
particles, or approximately to A 2 . Thus the most stable state occurs 
when each particle is within interaction range of every other, and the 
total binding energy is roughly proportional to A 2 rather than to A, as 
observed. 
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It is necessary, then, to assume that the number of nucleons that can 
interact strongly with a particular one is limited in some way that is inde¬ 
pendent of the mass number. This is the kind of effect that occurs in 
polyatomic molecules that are held together by homopolar bonds. As 
discussed in the last section, these bonds are produced by resonance 
between unperturbed states that are degenerate with respect to electron 
exchange. Homopolar bonds are saturated: the exchange of a pair of 
electrons between two atoms tends to preclude the formation of an 
electron-exchange bond with a third atom. On the basis of this analogy, 
the assumption is made that the interaction between a pair of nucleons is 
also of the exchange type and shows saturation. A formal treatment can 
be made without specifying what it is that is exchanged between the 
nucleons, although it appears that mesons are responsible for the inter¬ 
action. What is required is the inclusion of an exchange operator P i2 in the 
interaction energy ( 1 , 2 ): 

WirfCIA ■ ■ ■ ) - 7(l f 2)*(2,l, . . . ) (41.1) 

The P operator can act on the space or spin coordinates of the two 
particles, or both. The combination of different kinds of exchange 
operators that should be assumed, and the extent to which they can be 
replaced by interactions that couple the spins and orbital motions of a 
pair of nucleons (tensor interactions), are not settled at the present 
time . 1 

The near equality of the numbers of protons and neutrons in the 
nuclei up to neon indicates that the interaction between a pair of protons 
is about the same as that between a pair of neutrons. As A increases 
further, the ratio of number of neutrons to number of protons increases, 
up to about 1.6 for the heaviest nuclei. This can be understood in terms 
of the repulsive Coulomb interaction between protons, which does not 
saturate, and hence adds a term of order Z(Z — 1 )e 2 /R to the total energy 
of a nucleus. This energy is relatively small in He 4 (less than 1 Mev as 
compared with a binding energy of 28 Mev), but becomes progressively 
more important as Z increases. Without this term, the most stable 
nucleus of a given mass number and radius has the same number of 
neutrons and protons, since the kinetic energy of the protons is roughly 
proportional to that of the neutrons to (A — Z) s , and the sum of 
these is a minimum when Z = $A. Thus the inclusion of a positive Z 2 
term shifts the minimum toward smaller Z. For the heaviest nuclei, 
the Coulomb term leads at first to alpha radioactivity, and eventually 

1 See for example G, Breit and E. Wigner, Phys. Rev., 53, 998 (1938); G. M. Vollcoff, 
Phya. Rev., 62, 126, 134 (1942), 
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to instability with respect to fission of the nucleus into two parts of com¬ 
parable size. 

The approximate equality of neutron-proton and like-particle inter¬ 
actions is indicated by the large binding energy of the alpha particle 
(28 Mev) in comparison with that of the deuteron (2.2 Mev). The 
alpha particle has six pairs of interacting particles, and hence about twice 
as much binding energy per pair as the deuteron. Thus the like-particle 
interactions cannot be much weaker than the neutron-proton interaction. 
Also, they cannot be much stronger or the di-proton (He 2 ) would be 
stable, and it does not appear to exist. This qualitative conclusion is con¬ 
firmed by the more quantitative treatment of the two-nucleon problem 
that is outlined below. 

Nuclear Models. The mathematical difficulties that appear in the 
treatment of all but the two-nucleon systems are due to the lack of a sim¬ 
ple model for the nucleus that preserves its essential features. As pointed 
out in Sec. 38, the central-field approximation is useful in treating many- 
electron atoms because of the dominant role of the nuclear Coulomb 
field. In the last section, it was shown that the disparity in mass between 
electrons and nuclei makes possible an effective treatment of molecules. 
With nuclei, we have a situation in which the masses and interaction 
energies of all nucleons are about the same. Moreover, the interactions 
are strong and of short range, which means that a nucleon is more strongly 
influenced by its neighbors of the moment than it is by the average 
motion of all the other nucleons. In spite of this, some work has been 
based on a central field approximation, and the Hartree method (see Sec. 
38) has been applied. 

The most realistic model, at least in the case of the heavier nuclei, 
is Bohr’s liquid-drop model. 1 This exploits the analogy between nuclear 
matter and liquids that was mentioned earlier in this section. The 
liquid-drop model was first devised to explain the large ratio of radiative 
capture cross section to scattering cross section found in the collisions of 
slow neutrons with medium and heavy nuclei. If the nucleus acts simply 
as a potential well for the incident neutron, as in the central-field approxi¬ 
mation, application of the results of Secs. 19 and 36 shows that capture 
accompanied by the emission of electromagnetic radiation (gamma ray) 
is much less probable than scattering. Bohr showed that the experi¬ 
mental results can be understood if the incident neutron is assumed to 
amalgamate with the nucleus and distribute its energy among many of 
the nuclear particles. It then takes so long for enough energy to con¬ 
centrate on a neutron to permit it to escape (this would be observed as 
neutron scattering), that the excited nucleus is likely to radiate away 

1 N. Bohr, Nature , 137, 344 (1936). 
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its excess energy before that happens. The liquid-drop model has since 
been used to gain a semiquantitative insight into other nuclear properties. 

Neutron-Proton System, The most fundamental problem to be 
solved in connection with nuclei is the determination of the parameters 
of the interaction energy between pairs of nucleons. This adds a degree 
of complication to the theoretical study of nuclei as compared with the 
study of atoms and molecules, since in the latter cases the main interac¬ 
tion is known to be given by Coulomb’s law. However, qualitative 
considerations of the kind presented earlier in this section, together with 
extensive calculations and comparisons with experiment for the systems 
that contain two, three, and four nucleons, show that the basic interaction 
is strong and of short range . 1 The precise shape of the potential does not 
seem to make much difference, and we shall assume the square well form 
(see Fig. 13, page 76): F(r) = — 7 0 forr < a, V(r) = Oforr > a. When 
suitable exchange operators and tensor factors are included, this form is 
adequate for all situations except perhaps high-energy neutron-proton 
scattering. In this case, the existence of an additional weaker inter¬ 
action of longer range could have an appreciable effect (see the discussion 
of phase shifts in Sec. 19). 

It was shown in Sec. 15 that there is no bound state of a particle of mass 
m in this potential unless V o a 2 > 7r 2 ft 2 /8m. This result can be applied at 
once to the deuteron by taking m to be the reduced mass of the neutron- 
proton system, which is very nearly half the proton mass: 

- 5 — = 1.04 X 10“ 24 Mev-em 2 

om 

If we assume that a = 2.80 X 10 ” 13 cm, Vo must exceed 13.3 Mev in 
order for the deuteron to exist. Since the deuteron appears to have only 
one bound state, we assume that it corresponds to l = 0. Then the 
solution of Prob. 7, Chap. IV, shows that the measured binding energy of 
2.18 Mev is obtained with Vo = 21.4 Mev. 

The scattering cross section for neutrons of very low energy on pro¬ 
tons can then be obtained from Eq. (19.28). If we neglect E in compari¬ 
son with 7 0 we find that <r = 4.3 X 10 ~ 24 cm 2 . The measured cross sec¬ 
tion for neutrons of a few electron-volts energy in hydrogen is about 
21 X 10 " 24 cm 2 ; this energy is small enough so that it can be neglected in 
comparison with Vo, and large enough so that the binding of the proton 
in the hydrogen molecule does not affect the result. An explanation of 

1 Specific arguments against the existence of a short-range repulsion, strong enough 
to account for the observed density of heavy nuclei without the assumption of an 
exchange interaction, have recently been given by G. Parson and L. I. Sehiff, rhya. 
Rev., 74, 1664 (1948). 
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this discrepancy in terms of the dependence of the neutron-proton inter¬ 
action on the spin state was suggested in 1935 by E. Wigner (unpub¬ 
lished). The deuteron is known to have spin A, and so is in a triplet spin 
state. However, as discussed in Sec. 34 in connection with exchange 
collisions of electrons with hydrogen atoms, the colliding neutron and 
proton will be in a triplet state in three-fourths of the collisions, and in a 
singlet spin state in one-fourth of the collisions. Thus the discrepancy 
is removed if the singlet cross section is taken to be 71 X 10“ 24 cm 2 . 

If we assume that a is also equal to 2.80 X 10 -13 cm for the singlet 
interaction, Eq. (19.28) shows that this cross section is obtained for a 
potential depth of either 12.2 or 14.7 Mev. It is apparent that this is 
the resonance scattering case discussed in Sec. 19, and that these two 
potentials correspond to virtual and bound singlet states, respectively. 
The decision as to which is correct cannot be made on the basis of the 
dependence of the scattering cross section on incident neutron energy; it 
is shown in Sec. 19 that with l = 0, <r is a monotonic decreasing function 
of E in both cases, and there is not enough difference in the behavior of 
the two functions. It is found from other considerations 1 that the singlet 
state is virtual, so that the depth corresponding to this range is 12.2 Mev. 

Exchange Operators. The spin dependence of the neutron-proton 
interaction found in connection with low-energy scattering can be 
expressed in terms of a spin-exchange operator [see Eq. (41.1)]. Such an 
operator multiplies a triplet (symmetric) spin function by +1 and a 
singlet (antisymmetric) spin function by — 1. It follows from the fore¬ 
going discussion that about 20 per cent of the total triplet interaction is of 
the spin-exchange type. 

A space-exchange operator 2 multiplies wave functions of even l by +1 
and wave functions of odd l by — 1 ; it has no effect on the results obtained 
thus far, which are all for l = 0 . For higher energy scattering, the dis¬ 
cussion of See. 19 shows that the partial wave with l = 1 can be sig¬ 
nificant. If the phase shift 5i is small in magnitude and the higher phase 
shifts can be neglected, Eq. (19.32) can be approximated as 

<r(0) = ~ (sin 2 5 ( > + 35 1 sin 25 n cos 8) 

For energies high enough so that 5i is appreciable, 5 0 is likely to be 
between 0 and 90°, so that the sign of the angular asymmetry is deter¬ 
mined by the sign of 8\. 

1 Most reliably from the scattering of very slow neutrons in ortho- and para- 
hydrogen; seo J. Schwinger and K. Teller, Phyn. ltcv., 52, 280 (1037). 

2 This was introduced by ft. Majorana, Zaita.J. Phyaih , 82, 137 (1933), as a modifi¬ 
cation of the original suggestion of W. Heisenberg, Zrila.f. Phynik , 77, 1 (1932). 
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If now the interaction is predominantly of the space-exchange type, 
the potential energy is positive (repulsive) for l = 1 , and 5i is negative. 
Then neutrons incident on protons are preferentially scattered backward 
in the center-of-mass coordinate system, or at right angles in the labora¬ 
tory system. The protons tend to recoil in the forward direction in both 
systems. This effect can be regarded physically as a normal (non¬ 
exchange) scattering, in which the neutrons are preferentially scattered 
in the forward direction, together with an exchange of identity between 
the neutron and the proton. 

Proton-proton Scattering. The scattering of fast protons in hydrogen 
can be treated by the methods outlined in Sec. 20 , with suitable allow¬ 
ance for the identity and spin of the two colliding protons (see Sec. 33). 
When the interaction is a pure Coulomb field, combination of Eqs. 
{20.10) and (33.2) yields the Mott scattering formula 1 

C t ( 0 ) = I cosec4 9 + se ° 4 * 9 

— cosec 2 id sec 2 id cos In (tan 2 id) j J (41.2) 

which is expressed in the center-of-mass coordinate system. This for¬ 
mula represents only the experiments for protons of less than about 0.2 
Mev bombarding energy, since for higher energies the protons approach 
closely enough so that the specifically nuclear interaction becomes 
appreciable. Up to several Mev energy, only the 5 n term in (20.24) need 
■be included. It must be remembered that because of the exclusion 
principle, the partial wave with l = 0 is associated with a singlet spin 
state. Thus experiments with fast protons are required if information 
concerning the triplet proton-proton interaction is to be obtained. The 
indication at present is that the singlet proton-proton interaction agrees 
in range and sign with the singlet neutron-proton interaction discussed 
earlier in this section, and is slightly smaller in magnitude . 2 

Problems 

1. Find an expression for the electron density n(r) in the Thomas-Fermi model 
in terms of the dimensionless function x, and show that the radius of a sphere that 
encloses a fixed fraction of all the electrons is proportional to Z~K 

2. Use Lagrange’s method of undetermined multipliers to show that the condition 
that the integral in (38.9) is stationary, when the u 's are varied but kept normalized, 
is given by (38.11). 

3. Two p electrons (Z - 1) can have L = 0, 1, or 2 and S « 0 or 1, in the RusselL- 
Saunders case. Are all combinations of L and S permitted if the n values of the two 
electrons are different? Are they all permitted if the n values are the same? 

1 N. F. Mott, Proc. Roy. Soc ., A126, 259 (1930). m is half the proton mass, and 
v is the relative speed. 

4 G. Breit, H. M. Thaxton, and L. Eisenbud, Phys . Rev 55, 1018 (1939). 
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4. Show that spin wave functions in the Eussell-Saunders case that are eigen¬ 
functions of S a with different values of the total-Bpin quantum number S are orthog¬ 
onal. Use a method like that which was employed in Sec. 10 to show that energy 
eigenfunctions corresponding to different eigenvalues are orthogonal. 

5. Show by direct computation that J => L + S commutes with L*S, and hence 
with the Hamiltonian (39.1). 

6. Use the M matrices given in Eqs. (24.16) to show that the wave functions 
(39.6) have the indicated eigenvalues of J 2 and J g . 

7- Estimate the ratio of the term in (39.9) that is linear in H to the kinetic-energy 
term, for a hydrogen atom in a magnetic field of 10 6 gauss. 

8. Estimate the magnitude of the magnetic field strength for which the two 
perturbation terms in (39.11) are equal for an alkali atom. 

9. Construct a diagram that shows the relative displacements and intensities of 
the % and <r Zeeman components of the a P —> *S transitions in an alkali atom when the 
magnetic field is weak. Construct a similar diagram when the field is strong. 

10. Show that the ratio of the distance traveled by the nuclei of a molecule during 
a period of the electronic motion, to the dimensions of the molecule, is of order (m/Mft 
in the case of vibrational motion, and (m/M) in the case of rotational motion. Do 
these results justify an adiabatic type of approximation? 

11 . Does the internuclear potential (40.14) obtained for the hydrogen molecule 
approach the — 1/E 6 form obtained in Sec. 27 for large if? If not, why doesn't it? 

12. Set up and solve the wave equation for a rigid rotator: an object that has no 
kinetic energy of rotation about a particular axis, and equal moments of inertia about 
the two perpendicular axes. 

13. Derive the selection ruleB for transitions between rotational levels of a diatomic 
molecule. 

14. Discuss the selection rules for vibrational transitions when the two nuclei of a 
diatomic molecule are the same and when they are different. 

16. Calculate the contributions to the vibrational energy of a diatomic molecule of 
the neglected fifth- and sixth-power terms in the expansion (40.18), and show that 
their neglect is justified in arriving at the (v -f- i) 2 energy given in (40.21), Use matrix 
methods to get. the needed matrix element* of x h and x a for a harmonic oscillator. 

16. Calculate the minimum kinetic, energy in Mov of a nucleon that moves in a 
rigid-walled sphere whose radius It corresponds to a uranium nuclous. 

17. Suppose that the interaction between a neutron and a proton is the same in 
the singlet as in the triplet state, and is represented by a square well. Is there any 
value of a that will lit both the deuteron binding energy (l « 0) and the slow-neut.ron 
scattering cross section? If so, what is it? 

18. Make use of the work of Bee. 20 to obtain an expression for the ratio of the 
proton-proton scattering with a phase shift 5 0 (and no others), to tho Mott scattering 
given by Eq. (41.2). 

19. The ground-state wave function V'o of an atom or nucleus, if nondegencra.te, 
has a definite value Jo for its total-angular-momeiitum quantum number. Since the 
diagonal matrix element of an operator il for a state ^ vanishes unless is spheri¬ 
cally symmetric, show that the expectation value of £2 for the state i/'o is zero unless 
the function 12 ^ 0 , when expanded as a series of eigenfunctions of the operator J 8 , 
contains a term with quantum number Jo. If £2 is the operator for a 2 l electric or 
magnetic multipole moment, show also that its expectation value is zero unless Jo £ kl 
(compare with the discussion at the end of See. 26). 



CHAPTER XII 

RELATIVISTIC WAVE EQUATIONS 


In this chapter we extend the nonrelativistic Schrodinger wave 
equation to the description of the motion of a particle that has a speed 
approaching that of light. This extension can be made in many ways, 
each of which is consistent with the Lorentz transformation equations of 
the special theory of relativity. 1 A characteristic feature of relativistic 
wave equations is that the spin of the particle is built into the theory from 
the beginning, and cannot be added afterward as Pauli added the electron 
spin to Schrodinger^ nonrelativistic equation. This feature provides a 
useful gauge of the applicability of a particular equation to the description 
of a particular kind of particle. Two relativistic equations are con¬ 
sidered here: the scalar equation due to Schrodinger that describes a 
particle without spin, and the more complicated equation due to Dirac 
that describes an electron. In discussing these equations, we devote 
our attention mainly to the deductions that can be made from them and 
do not attempt to establish their Lorentz invariance. We shall, there¬ 
fore, continue to use three-dimensional vector notation rather than the 
more elegant four-dimensional notation of special relativity theory. 
The invariance of an equation can usually be inferred quite convincingly 
from its symmetry between the space coordinates and the time. 

42. SCHRGDINGER’S RELATIVISTIC EQUATION 
At the time when Schrodinger developed his nonrelativisitic wave 
equation, he also proposed an extension of it that meets the requirements 
of special relativity. 2 This equation follows quite naturally from the 
transition in classical dynamics from the nonrelativistic relation 



(42.1) 


between the energy and momentum of a free particle, to the corresponding 
relativistic relation 


E 2 = csp 2 + m 2 c A 


(42.2) 


1 For a review of special relativity, see, for example, P. G. Bergmann, “Introduc¬ 
tion to the Theory of Relativity,” Part I (Prentice-Hall, New York, 1046), or H. C. 
Tolman, “Relativity, Thermodynamics and Cosmology,” Chaps. II, III, and IV 
(Oxford, New York, 1934). 

2 E. Schrodinger, Ann. d. Phyaik, 81, 109 (1920), Sec. 6. 
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where now E includes the rest-mass energy me 1 . We proceed by adopting 
the substitutions (6.13) for E and p 

E ih p —* —ih grad (42.3) 

Free Particle. A relativistic wave equation for a free particle can be 
obtained by substituting (42.3) into (42.2) and operating on a wave 
function just as the substitution of (42.3) into (42.1) yields Eq. 

(6.11). The result is 

- W ^ = -JiWV + m 2 c^ (42.4) 

Equation (42.4) has plane wave solutions of the form 

exp i(k • r — c ot) (42.5) 

which are eigenfunctions of the operators E and p in (42.3) with eigen¬ 
values hu and ftk, respectively. It is apparent that (42.5) satisfies Eq 
(42.4) if 

hu = + (ftVk 2 + ra 2 e 4 )* (42.6) 

The positive and negative square roots in (42.6) correspond to an ambigu¬ 
ity in the sign of the energy that also results from the classical expression 
(42.2). We take only the positive square root for the present, and 
return to the negative-energy solutions at the end of Sec. 44. 

Expressions for the charge and current densities can be found in 
analogy with those obtained in Sec. 7. The conservation equation 

+ div S(r,<) = 0 (42.7) 


turns out to be invariant with respect to Lorentz transformations. We 
multiply (42.4) on the left by the complex conjugate equation on the 
left by yp, and subtract one from the other. Then (42.7) results, if we 
define real quantities 


= — ($— - ) 

{ , ) 2 mc*y dl V dl,J 

S(r,0 = ~ ($ grad $ grad if) 


(42.8) 


This expression for S is identical with the nonrelativistic form (7.3), 
and the expression for P can be shown to reduce to (7.1) in the non¬ 
relativistic limit (see Prob. 2). It should be noted that P given by (42.8) 
is not necessarily positive, and hence cannot be interpreted as a position 
probability density. It can, however, be multiplied by e and interpreted 
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as an electric charge density, since charge density can have either sign so 
long as it is real. 

Electromagnetic Potentials, We can include the electromagnetic 
potentials A(r y t), in the wave equation by making use of the fact 

that 4> and (l/c)A have the same Lorentz-transformation properties as 
E and p. In analogy with the nonrelativistic expression (23.14), we 
replace (42.2) by 

(E - e<£) 2 = (cp - eA) 2 + mV (42.9) 

for a particle of charge e. The substitutions (42.3) then give 

(-*’!?- “4 %+*•)* 

= [ — h 2 ^ 2 + 2iehcA • grad + iehc (div A) + e 2 A 2 + mV] 4* (42.10) 


We can now find the connection between Eq. (42.10) and the similar 
Eq. (23.24) in the nonrelativistic limit. We make the substitution 

imoH 

*(r ,0 = ^(r fie" * (42.11) 

in (42.10), and assume that operation with ih(d/dt) on p r gives a result 
that is of the same order as eW and small in comparison with mV'. 
This is equivalent to subtracting out the rest energy and assuming that 
the remaining energies are small in comparison with it. Differentiation 
of (42.11) gives 

dt \dt h*J 

av /ay 2imc 2 di' m 2 c 4 \ 

df 2 \ dt 2 h dt h 2 * ) e 

The first term in each of these derivatives can be neglected, as can the last 
two terms on the left side of (42.10), which then becomes 


^ 2ihmc 2 ^ + m 2 cV' “ 2c h 

With these approximations, Eq. (42.10) becomes the same as (23.24) 
if yp' is replaced by 

There is no way in which the Pauli spin matrices (33.3) can be included 
in Eq. (42.10) without destroying the invariance of the theory. This is 
not surprising, since the spin matrices transform like the components of a 
three-dimensional, rather than a four-dimensional, vector, and since \f/ 
has one component rather than two components like the spin functions 
(33.4). Thus the Schrodinger relativistic equation represents a particle 
that has no spin. 
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The structure of Eq. (42.9) shows that a “potential-energy” term 
cannot be added arbitrarily to (42.10), as the term V\j/ was added to 
(23.24) to give Eq. (35.1). The Lorentz-transformation properties of 
any such term must be investigated first. If it transforms like part of a 
four-dimensional vector, the rest of this vector must be included in some 
such manner as <j> and (l/c)A were included in (42.9). If it is an invariant 
with respect to Lorentz transformations, it can be included as part of the 
rest energy me 2 . 

Separation of the Equation. The wave equation (42.10) can be 
separated with respect to r and t, if the potentials A ,<#> are independent of 
the time. We then put 

M*it) = u(x)e » 

and substitute into (42.10) to obtain 
C E - e<t>) 2 u 

= [—ft 2 c 2 V 2 + 2iehcA • grad + iehc (div A) + e 2 A 2 + m 2 e 4 ]w (42.12) 

We now specialize to the case in which A = 0 and </>(r) is spherically 
symmetric. Equation (42.12) then becomes 

(— ft 2 c 2 V 2 + m 2 c 4 )u(r) = [E - ecj>(r)]*u( r) (42.13) 

which can be separated in spherical coordinates (see Sec. 14) to give 

u(r,0,<l>) = /i(r)y Zm (0,<£) 


l = 0 , 1 , 2 , . . . 

This reduces to the nonrelativistic radial equation if we put E = me 2 + E r y 
and assume that E' and e<j) can be neglected in comparison with me 2 . 
Then the bracket on the right side of (42.14) becomes (2 m/h 2 ) (. E f — e<£), 
as it should. 

Energy Levels in a Coulomb Field. An exact solution of (42.14) can 
easily be obtained if we put c4> = — Zc 2 /r , by making use of the results of 
Sec. 16. This situation would represent a hydrogen atom were it not for 
the fact that the particle described by (42.10) has no spin, and so cannot 
be an electron. 

If we put p = ar , Eq. (42.14) can be written as 


{E — c<£) 2 ■ 
hV~ 


(42.14) 


r i d( d\ iq+_i) l R = \ 

r 2 dr V dr) ^ r l J 


1 d ( 2 dR\ , A 

«^ + \p 

Ze* 



4 (m a c* — .E 2 ) = 2Ey 

A 2 c 2 ’ hca 


a 


(42.15) 
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This has precisely the form of Eq. (16.7) except that 1(1 + 1) has been 
replaced by 1(1 + 1) — 7 2 * The parameter X is determined by the bound¬ 
ary condition on R when p = and E is expressed in terms of X by 
eliminating a from the last two of Eqs. (42.15): 

E = me 2 ^1 

A study of the way in which Eq. (16.7) was solved shows that solu¬ 
tions of (42.15) that are finite at p = 0 and oo exist only if 

X = n r + 5 + 1 (42.17) 

where n' is zero or a positive integer and s is the nonnegative solution of 
the equation 

s(s + 1) = 1(1 + 1 ) - 7 2 (42.18) 

Equation (42.18) has the two solutions 

s = - 1 ± i[(2 1 + l) 2 - 4 7 2 ]* (42.19) 

of which one is positive and the other negative for l > 0. For Z = 0, 
both the s values given by (42.19) are negative; however, 7 is quite small 
(very nearly equal to Z/ 137 if e is the electronic charge), so that use of 
the upper sign in (42.19) gives a value of s that is close to zero for physi¬ 
cally interesting values of Z. Moreover, even though R(r ) behaves like 
r 8 near r = 0 , and so is singular at the origin, the integral of P(r) given 
by (42.8) converges there so that the total electric charge is finite. We 
thus use the upper sign in (42.19) for all Z, and obtain from (42.17) 

X = n' + i + [(! + i) 2 - 7 2 ]* (42.20) 

Equations (42.16) and (42.20) give a fine structure to the nonrela- 
tivistic energy levels (16.15). This can be seen by expanding the expres¬ 
sion for the energy levels in powers of 7 2 . The result to terms of order 
7 4 is 

E - ™-[i(rq^-i)] <«•*> 

where n = n' + l + 1 is the total quantum number of Eq. (16.14), and 
can take on positive integer values. The first term on the right side of 
(42.21) is the rest energy. The second term is 

me 2 7 2 _ mZ 2 e 4 
~~2n*~ ~ “ MW’ 


+ $)~* (42.16) 


and agrees with (16.15). The third term is the fine-structure energy, 
which removes the degeneracy between states of the same n and different- 
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l. The total spread of the fine-structure levels for a given n is easily 
seen from (42.21) to be 


me 2 7 4 n — 1 
n 8 u — ^ 


(42.22) 


This is much larger than is observed experimentally in the spectrum of 
hydrogen. 


43. DIRAC’S RELATIVISTIC EQUATION 
Dirac 1 approached the problem of finding a relativistic wave equation 
by starting from the Hamiltonian form (23.1) 

ih | *(r,Q = (43.1) 

The classical relativistic Hamiltonian for a free particle is the positive 
square root of the right side of Eq. (42.2). However, if this is substituted 
into (43.1) and p is replaced by — ih grad, the resulting wave equation is 
unsymmetrical with respect to space and time derivatives, and hence not 
relativistic. Dirac therefore modified the Hamiltonian in such a way 
as to make it linear in the space derivatives. 

Free-particle Equation. The simplest Hamiltonian that is linear in 
the momentum and mass terms is 

II = — ca ■ p — fime 2 (43.2) 

Substitution into (43.1) leads to the wave equation 

(E + cot * p + Pmc 2 )\l/ = 0 

or (43.3) 

^ih ^ — ihcJot ■ grad + / 8mc 2 ^ = 0 

We now consider the four quantities a x , a V) a z , and /?. If (43.3) is to 
describe a free particle, there can he no terms in the Hamiltonian that 
depend on the space coordinates or the time, since such terms would 
have the properties of spacc-time-dependcnt energies and give rise to 
forces. Also, the space and time derivatives are to appear only in p and 
E , and not in a and /3, since (43.3) is to be linear in all these derivatives. 
We thus conclude that a and 0 are independent of r, t , p, and E } and hence 
commute with all of them. This docs not necessarily moan that ct and /3 
are numbers, since they need not commute with each other. 

1 P. A. M. Dirac, Proc. Roy. Soc. } A117, 610 (1028); “The Principles of Quantum 
Mechanics/' 2d ed., Chap. XII (Oxford, New York, 1035). 
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We can learn more about a and 0 by requiring that any solution ^ of 
(43.3) also be a solution of Schrhdinger's relativistic equation (42.4) 
(the converse need not be true). This is a reasonable requirement since 
in the absence of external fields, the wave-packet solutions of (43.3) 
whose motions resemble those of classical particles must have the classical 
relation (42.2) between energy, momentum, and mass (see Prob. 1), 
We therefore multiply Eq. (43.3) on the left by (E — ca * p — 0mc 2 ) to 
obtain 

{E 2 - C 2 [c%pl + C%pl + 0?»v\ + (obOty + CLyOLi)VxVv 

+ ( otyOt, + oLtO^PyPt + (a 2 a x + a x a 2 )p t p x \ — m 2 c 4 0 2 

7YIC 2 [(cLj$ “1“ fa x )p a (<*i/0 “f" fi°hi)Pv H” “f” 0^*)?^]}^ = 0 (43.4) 

where the substitutions (42.3) for E and p in terms of differential opera¬ 
tors are implied. Equation (43.4) agrees with (42.4) if «,0 satisfy the 
relations 

a\ = a 2 = a 2 = 0 2 = 1 

a x a v + a v&x = CLy(x z + a t oL v = a z oi x + a x a z '*= 0 ( 43 . 5 ) 

a xfi + fax = OLyfi + Pdy = CL$ + fa 2 = 0 

The four quantities are said to anticommute in pairs, and their squares are 
unity. 

Since a, 0 anticommute rather than commute with each oilier, they 
cannot be numbers. We have seen in Chap. VI that quantities of this 
type can be expressed in terms of matrices, and it is convenient for calcula¬ 
tion to have a matrix representation of them. We note first, that since 
the H given by (43.2) is Hermitian, each of the four matrices a,0 must 
be Hermitian, and hence square. Our problem is to find an explicit repre¬ 
sentation in which, say, one of these matrices is diagonal (then the others 
cannot be diagonal since they do not commute with this one). In the 
interests of simplicity, we shall require the representation to have as low a 
rank as possible. 

Matrices for a and 0. The squares of all four matrices are unity, so 
that their eigenvalues are +1 and -1, We arbitrarily choose 0 as the 
matnx that is to be diagonal, and rearrange its rows and columns so that 
all the +1 eigenvalues are grouped together in a matrix of rank n, and all 
the 1 eigenvalues are grouped in a matrix of rank m . Since 0 anti¬ 
commutes with a, it cannot be a constant matrix, and so both n and m 
mus e greater than zero. The 0 matrix can be represented schemati- 

An I Itt nn L 
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(43.7) 


The dashed lines in (43.7) separate the submatrices 1, 0, 0, and —1 that 
appear in (43.6). 1 We now consider the matrix equation a, a/3 + 0a* = 0, 
the jl element of which is 

(a»)ji(& + ft) = 0 


Here ft and ft are two of the eigenvalues of ft which are arranged in 
accordance with (43.6) or (43.7). If ft = ft, then (o^/i = 0, whereas if 
ft and ft have opposite signs, {a x )n need not be zero. Thus the matrix for 
a x can be written in the form 



where a x i has n rows and m columns, and a x % has m rows and n columns. 
Since the square of (43.8) is a unit matrix, we see also that 

OL x iOi X 2 = 1, Ois2C6 x \ = 1 (43.9) 

The unit matrix that appears on the right side of the first of Eqs. (43.9) 
has n rows and columns, and the unit matrix in the second equation has 
m rows and columns. It is not difficult to show that no matrices can 
be found that satisfy (43.9) if n } m equal 1,2 or 2,1. We therefore con¬ 
sider the two possibilities n = m = 1 and n = m = 2, as giving matrices 
of minimum rank. It is apparent that a y and a t can be written in forms 
similar to (43.8). 

We have already obtained three anticommuting matrices of the form 
(43.6) or (43.8) with n — m = 1. These are the Pauli spin matrices 
(33.3) ’ ' 

ff * = (i o) = (*• -;} ff ‘ = (o -i) (43 - io) 

which satisfy the equations 

O xCT y = == t(Tg (43.11) 


1 The matrices 1 , 0 are the same as the unit and null matrices 1 , O defined in 
Sec. 21. 
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together with the two similar relations obtained by permuting x, y, z. Any 
matrix with two rows and columns has four elements, and so can be 
expressed as a linear combination of the four linearly independent 
matrices o- a , and 1. Then it is not difficult to show that a fourth 
matrix that anticommutes with all three of those in (43.10) cannot be 
found. 

We therefore try n = m = 2, and for simplicity take a m i = 
etc. Then Eqs. (43.9) become aJi = 1, and the equation a x a y + — 0 

becomes oxio^i + ayia x i = 0. From these and the similar relations 
obtained by permuting we see at once that we can identify oc x i with 
o-®, etc. We thus arrive at an explicit matrix representation for : 



(43.12) 


These matrices are evidently Hermitian; we abbreviate them as 


»-(j _;) «-(; s) <«.i3) 

where each “element” is a matrix with two rows and columns. 1 

Free-paxticle Solutions. Now that a and /S are represented by 
matrices, Eq. (43.3) has no meaning unless the wave function ^ is itself 
a matrix with four rows and one column: 


»Kr ,t) = 


I ^s(r.O | 
\ V's(r.f) I 


(43.14) 


Then (43.3) is equivalent to four simultaneous first-order partial dif¬ 
ferential equations that are linear and homogeneous in the four ^’s. 
Plane wave solutions of the form 


= Uj exp i(k • r - ut), j = 1,2,3,4 (43.15) 

1 The relativistic character of the Dirac equation can be made more apparent if 
(43.3) is multiplied through on the left by /3; this enhances the symmetry between 
space and time derivatives, since the four matrices (3 , /3a have similar properties to the 
four matricea /3, a. 
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can now be found, where the Uj are numbers. These are eigenfunctions 
of the energy and momentum operators (42,3) with eigenvalues ftco and 
ftk, respectively. Substitution of (43.15) and (43.12) into (43.3) gives 
a set of algebraic equations for the Uj, where E = hca and p =» ftk are now 
numbers, 

(E + mc 2 )ui + cpzUt + c(p x — ip v )uA = 0 
( E + mc 2 )u 2 + c{p x + ip v )u 2 — cp z ut = 0 , 3 . 

(E — mc 2 )us + cp z Ui + c(p x — ip y )u 2 = 0 ^ ‘ 

(E — mc 2 )u\ + c(Px + ipy)ui — cp z u 2 = 0 


These equations are homogeneous in the Uj, and have solutions only if 
the determinant of the coefficients is zero. This determinant is 


( E 2 - m 2 e 4 - c 2 p 2 ) 2 , 


so that the relation between E and p is in agreement with (42.2). 

Explicit solutions can be obtained for any momentum p by choosing a 
sign for the energy, say E+ = + (c 2 p 2 + m 2 c 4 )*. Then there are two 
linearly independent solutions, which are conveniently written as 


Oil 


cp* 

E+ + me 2 ’ 



u 2 — 


c(px + ipy ) 
E + + me 2 


) 


u 2 


Cp s 

E+ + me 2 ’ 


Uz = 1 , ^4 = 0 

(43.17) 

Us — 0 , — 1 


Similarly, if we choose the negative square root E-. = — (c®p 2 +■ ra 2 c 4 )*, 
we obtain two new solutions, which are conveniently written as 


u i = 1, u 2 = 0, 


_ cp z 

U 3 = r n — . - 

— + me 1 


«i = o, u* = 1, 


U 3 


c(Pt - m) 
-E- + me 1 ’ 




cjpx ± i'Pv) 

— + me 2 

(43.18) 


U\ — 


CPz 

—+ me 2 


Each of these four solutions can be normalized, in the sense that ^ V = 1, 
by multiplying it by the factor jl + [c^p'VC^H- + me 2 ) 2 ])"*; is the 
Hermitian adjoint of and is a matrix with one row and four columns. 

It is apparent that the solutions (43.17) correspond to positive energy, 
and the solutions (43.18) to negative energy. In the nonrelativistic 
limit, in which E + = — is close to me 2 and large in comparison with 
c|p|, u\ and u 2 are of order v/c times u 3 or U\ for the positive-energy solu¬ 
tions (v is the speed of the particle); the opposite is true for the negative- 
energy solutions. The physical distinction between the two solutions 
for each sign of the energy can be seen by defining three new spin matrices 
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</ m </„, a, that have four rows and columns 

°) < 4s i9 > 

We shall see at the beginning of Sec. 44 that %hd' can be interpreted as 
the operator that represents spin angular momentum. When the small 
components of the wave function can be neglected, it is easy to see that 
^ is an eigenfunction of with eigenvalue +1 for the first of each pair of 
solutions (43.17) and (43.18), and eigenvalue —1 for the second solution 
of each pair. 

Charge and Current Densities. We can obtain a conservation equa¬ 
tion by multiplying the second of Eqs. (43.3) on the left by the Iier- 
mitian adjoint equation 


—ih + t7lc(grad^*) • « + ^*/3mc 2 = 0 
at 


on the right by and taking the difference of the two results, 
get Eq. (42.7) if we define the real quantities 

P( r,0 = 

S(r J) = — c4t*wp 


We then 


(43.20) 


The expression for P has the nonrelativistic form (7.1); since it is never 
negative, it can be interpreted as a position probability density. It can 
be shown that the expression (43.20) for S reduces to (7.3) in the non¬ 
relativistic limit (see Prob. 6). 

The operator — ca can be interpreted more directly as a particle 
velocity by calculating the time derivative of the position vector r from 
Eq. (23.2), With the Hamiltonian (43.2) and the commutation relations 
(23.16), we obtain 

% - a <* H - < 43 - 21) 


Thus the eigenvalues of a velocity component are ±c. This result can 
be made plausible from a physical point of view with the help of the 
uncertainty relation (3.1). A very precise measurement of instantaneous 
velocity [which according to (43.21) is not the same as momentum in a 
relativistic theory] requires the accurate measurement of the position of 
the particle at two slightly different times. Such accurate position 
measurements imply that the momentum of the particle is completely 
unknown, so that all momentum values are about equally probable. 
Then very large momenta are much more likely to result than small 
momenta, and these correspond to velocity components close to the 
speed of light. 
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Electromagnetic Potentials- Terms that involve the electromagnetic 
potentials can be added to Eq. (43.3) in a relativistic way by making the 
usual replacements cp cp — eA and E —> E — e0, where the particle 
described by the equation has electric charge e. We thus obtain 

[E — e<j) + « • (cp — eA) + jS mc 2 \p =» 0 (43.22) 

Here E and p stand for the operators (42.3). This equation can be 
reduced to a form that is similar to (42.10) by multiplying it on the left 
with [E — e<j> — a • (cp — eA) — j3rac 2 ]. The result is 

{(E — e(j>) 2 — [a • (cp — eA)] 2 m 2 c 4 

+ {E — e<f>)o t • (cp — eA) — a • (cp — eA )(E — e<j!>)}^ « 0 (43.23) 

The second operator in (43.23) can be reduced by making use of the 
following relation; 

(a • B)(a • C) = B • C + id' • (B X C) (43.24) 

where B and C commute with a but not necessarily with each other (see 
Prob. 7). We identify both B and C with (cp — eA); we also require the 
relation 

(cp — eA) X (cp — eA) = —ce(A X p + p X A) » iehccuil A = iehcH 

where use has been made of (23.15). With this substitution, Eq. 
(43.24) becomes 

[a * (cp — eA)] 2 = (cp — eA) 2 — ehcd f ■ H 


The last two operators in (43.23) can be rewritten as 


— eoL • (EA — A E) — cea • (</>p — p <f>) 

= — ieha • 


dA 

et 


— ichca * grad </> = iehca. ■ E 


where use has again been made of (23.15). Equation (43.23) then 
becomes 

[(E — e<f>) 2 — (cp — eA) 2 — m 2 c 4 + ehcd' ■ H + iehca • E]^ = 0 (43.25) 

The first three terms are precisely the same as (42.10). The physical 
significance of the last two terms will now be shown from consideration of 
the nonrelativistic limit of the entire equation. 

We could proceed just as we did in obtaining the nonrelativistic limit 
of Eq. (42.10). A slightly different approach consists in putting 


E — E* -(- ? tic 2 


(43.26) 
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and assuming that E' and e are small in comparison with me 2 ) this is 
equivalent to the substitution (42.11) and the subsequent approxima¬ 
tions. We can then make the replacement 

(E — etpy — m 2 c 4 = 2 mc 2 (E f — e 0) 
in (43.25) to obtain 



Now E r is equivalent to the time derivative operator ih(d/dt) if a factor 

imoH 

e h is taken out of Thus (43.27) is the nonrelativistic Schrodinger 
equation (23.24), with two additional terms that involve H and E directly. 

The H term has the form associated with the energy of a magnetic 
dipole of moment (eh/2mc)6\ Now it was shown in (43.18), for a free 
particle, that the third and fourth components of the wave function are 
large in’comparison with the first and second components for the positive- 
energy solutions in the nonrelativistic limit. This can also be shown 
without difficulty, when the particle is not free, from the structure of the 
general equation (43.22). Equation (43.19) shows that d' operating on 
the four-component wave function is the same as d operating on the large 
components alone. Thus the two large components of (43.27) with the 
H term give just the nonrelativistic equation with the Pauli spin matrices 
and the correct coefficient for the magnetic moment of an electron [see 
Eq. (39.10)]. 

We now show that in practical cases, the E term in (43.27) is of order 
(v/c) 2 times the e<f> term, and so is to be neglected in the nonrelativistic 
limit. We note first that the expectation value of a is of order 
(y/c)J^*^dr, since (43.13) shows that a mixes the large and small com¬ 
ponents (we also saw earlier that — col is the velocity operator). For an 
electron that is part of a system of linear dimensions a, e<j> is of order 
eEa, and h/a ™ p ~ mv. Thus the ratio of the E to the e<t> terms in 
(43.27) is of order 

ehv E 1 v 2 
me 2 eEa c % 

In contrast with this, we have already seen in Eqs. (39.9) and (39.10) that 
the H term in (43.27) is of the same order as the other magnetic terms 
that are linear in A. While the E term must be omitted from (43.27) 
for a consistent nonrelativistic approximation, it cannot be dropped from 
the relativistic equation (43.25), where it is required to preserve Lorentz 
invariance. 
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44. DIRAC’S EQUATION FOR A CENTRAL FIELD 

In the last section, the existence of the magnetic moment of an elec¬ 
tron was demonstrated explicitly by showing that the expected extra 
magnetic energy appears in the nonrelativistic approximation. The 
electron spin carries no energy in itself, and so can be observed only 
through its coupling with the orbital motion of the electron. In the 
first part of this section, we make this coupling apparent in two ways: 
through conservation of total angular momentum, and through the spin- 
orbit energy that was introduced in Sec. 38. In both cases we work with 
such potentials A,<£ that there is no transfer of angular momentum to the 
electron; this implies that we have a central field (A = 0 and <j> spheri¬ 
cally symmetric). In the latter part of this section, we separate the Dirac 
equation for a general central field, and find the energy levels of the 
hydrogen atom. 

Spin Angular Momentum. With A(r,£) = 0 and </>(r,2) = <f>(r), Eq. 
(43.22) can be written as 

ih ~dt = ^ (44.1) 

H = — ca ■ p — |9rac 2 + V 

where V = e<£. We might expect that the orbital angular momentum. 
L — r X p is a constant of the motion in such a central field. In order 
to investigate this point, we calculate the time rate of change of L with 
the help of Eqs. (23.2) and (23.16): 

ih <] -Ig = LJ-I - HL X 

= -COL ■ [(VPz - ZVv )P ~ P (»IPz - ZPu)] 

= ihci^CXz'Py OiyPz) (44.2) 

since L commutes with any spherically symmetric function such as V(r). 
It is apparent that L docs not commute with //, and hence is not a con¬ 
stant of the motion. However, we expect on physical grounds that it is 
possible to define a total angular momentum that is constant in a central 
field of force. This means that we must find another operator such that 
the commutator of its x component with H is the negative of the right- 
side of 44,2); the sum of this operator and L is then a constant of the 
motion and can be interpreted as the total angular momentum. 

It is not difficult to see that the desired operator is a multiple of the 
6' defined in (43.19). From (43.11) and (43.13), we find that <r x com¬ 
mutes with a x and although not with the other components of a: 
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Q&t - ctyr* 


/V® 0\/0 <r v \ _ /o Oy\ /(T® 0 \ 

\0 &%/ \CTy 0 / \CTy 0 / \0 <T®/ 

-(“ r)-Ut)-^ 


The time rate of change of <5' can now he obtained: 

ih^ = <r'JB - H<r' tt = —2ic(«,p„ - <%p.) 


(44.3) 


It is apparent from (44.2) and (44.3) that the quantity L + i-ftd' com¬ 
mutes with H and can therefore be taken to be the total angular momen¬ 
tum. We refer to the operator 

S = W (44.4) 


as the spin angular momentum of the electron. 

Approximate Reduction; Spin-orbit Energy. We now wish to show 
that the spin-orbit energy (38.13) is a consequence of the Dirac equation. 
This term can be shown to be of order (v/c) 2 times the potential energy: 


1 1 li7, T 1 1 V , 

V 2 mV r dr ^ V mV a 2 




t/2 

C 2 


where a represents the linear dimensions of the system, and 

h 

- V ~ mv. 

a 

Thus the approximations that led to (43.27) are not adequate for the 
present purpose. 

In order to obtain a consistent approximation that is expressed in 
terms of the more familiar two-component wave functions, we replace 
yf/ in (44.1) by \pi and which now represent respectively the first two 
and the last two components of We assume that ^ 1,^2 together con¬ 
stitute a nonrelativistic energy eigenfunction, which means that 

E = E r + me 2 

is regarded as a number rather than an operator; W and V are assumed to 
be small in comparison with me 2 . The wave equation then becomes 

{E f + 2mc 2 - V)t 1 + cd ■ p^ 2 = 0 r44 6 n 

(E f - 7)^ 2 + cd • p^i = 0 ^ ' 

where p is still an operator. It is apparent from the first of these equa¬ 
tions that ypi is of order v/c times ip 2} so we eliminate it to obtain an equa¬ 
tion in terms of ^2 alone. The substitution 

\f /1 = — ( E f + 2mc 2 — 7)' 1 cd • p \f/ 2 
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in the second of Eqs. (44.5) gives 

®m « -ip) (i + t^t)" < 4 •+ T *‘ (44 - 6) 


Thus far, no approximations have been made. 

The desired approximation is obtained by keeping the lowest terms in 
an expansion in powers of (E' - V)/2mc\ The following relations are 
easily established: 


U + 

V + 2 me 4 ) 


(d ■ grad 7)(d 


2 mc 2 

p7 = 7p - 

p) = (grad 7) 


1 — 


W — 7 


2mc 2 

grad 7 

p + id ■ [(grad 7) X p] 


With the help of these, (44.6) becomes 

- ^^5 (grad 7) • (grad ^ 2 ) + 4^535 <*' [(grad 7) X P^a] (44.7) 

Further simplifications can be made if 7 is spherically symmetric. 
We use the relations 

, J __ . dV d 

(grad 7) • grad = -r- T 


, T/ 1 d7 

e radF = f* r 


and note that E' — 7 is approximately equal to p 2 / 2 m, the accuracy 
being sufficient to replace the second-order term {E 7)p 2 in (44.7) 
by p 4 /2 m. We can then rewrite (44.7) as 


E'yp 2 


= [11- 
|_ 2 m 


P 4 

8 m 2 c 2 


+ 7 


ft 4 d7 d 


+ 


4 m 2 c a dr dr 2 m 2 c 2 


2m 3 c 2 r dr J 


*2 (44.8) 


where now S = and L = r X p* 

The first and third terms on the right side of (44.8) give the non- 
relativistic Schrodinger equation. The second term haa the form of the 
classical relativistic mass correction, which can be obtained by expanding 
the square root of (42.2): 

E' = E — me 2 = (cV + m 2 c 4 )l ^35 


The last term is the spin-orbit energy (38.14), which is now seen to 
appear as an automatic consequence of the Dirac equation. The fourth 
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The equations for f and g are 

- g + & - (- + 2 )/ = 0 

P \ ^ P/ 

We look for solutions of (44.19) in the form of power series 

/ = p 8 (oo + dip +■••)> ao 5^ 0 

0 - p 8 (b o + 6xp + • - • ), b 0 7& 0 

Since (44.14) is supposed to be finite at r = 0, we expect that s is greater 
than or equal to one. However, in analogy with the solution of the 
Schrodinger relativistic equation (42.15) for the Coulomb field, we shall 
admit a value of s slightly less than one, since the volume integral of 
will still be finite. 

We substitute (44.20) into (44.19), and equate the coefficients of 
p*+f-i to zero: 


(44.19) 


(44.20) 


for v > 0 . 


(s + v + k)b v — bp~i — ya, — — a v - 1 = 0 
($ -|- v k)a v — a v — x 4 * 7 b p -- b p —i = 0 

OL 

When v ■= 0, the equations analogous to (44.21) are 


(44.21) 


(s + k)b 0 — ya 0 = 0 
(8 — k)a 0 + ybo = 0 


(44.22) 


Equations (44.22) have the required nonvanishing solution for a 0 and 
&o only if the determinant of their coefficients vanishes; this gives 

s = ± (F - 7 2 )* (44.23) 


Because of the boundary condition at the origin, we take the upper sign 
for 8 in (44.23). 

A relation between a p and b p can be obtained by multiplying the first 
of Eqs. (44.21) by a, the second by ai, and subtracting. 

b ? [a(s + v + k) — oi X y] = a v [ai(s + v — k) + ay] (44.24) 

where use has been made of (44.16). We can now examine the behavior 
of the solution at large r. Unless both the series (44.20) terminate, this 
behavior is determined by their high terms, so we can neglect constant 
factors in comparison with v. We then obtain from ( 44 . 21 ) and (44.24) 


2 


2 
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This means that both series have the asymptotic form e 2 **, and regular 
solutions are obtained only if they terminate. Suppose that this occurs 
at v = n so that On'+i = 6 n '+i = 0. Then both Eqs. (44.21) give the 
relation 

ol\Ow = — ctbn', n’ = 0,1,2, . . . (44.25) 

We obtain energy levels by setting v = n' in (44.24), and making use 
of (44.26). With the help of (44.16), we find that 

2a (s + ft') = 7(«x — «a) = 

The square of this is 

(m 2 c 4 - E 2 )(s + n'Y = £‘V 
which is easily solved to give 

—[-rfwr <44 ’ 26) 

Equations (44.23) and (44.26) are equivalent to the formula first de¬ 
rived by Sommerfeld 1 on the basis of the old quantum theory. This 
formula accounts quite well for the spectrum of hydrogen. 2 The fine 
structure is made evident by expanding (44.26) in powers of 7 2 . The 
result to terms of order 7 4 resembles (42.21) but is not quite the same: 


where n =* n 9 + \k\ is the total quantum number of Eq. (16.14), and 
\k\ can take on positive integer values. The total spread in energy of 
the fine-structure levels for a given n is easily seen from (44.27) to be 

me 2 y A n — 1 
n z 2 n 


This is substantially less than the value (42.22) obtained from the 
Schrftdinger relativistic equation, and is in agreement with experiment. 

Classification of Energy Levels. For n f > 0, all positive and nega¬ 
tive integer values of k are permissible [we saw from (44.12) that k can¬ 
not be zero]. For n l = 0, however, a contradiction can arise between 
(44.22) and (44.25); these give 


ao s + k ao a 


(44.28) 


1 A. Sommerfeld, Ann. d. Physik , 61, 1 (1916). 

a Small deviations from the Sommerfeld-Dirac fine-structure formula have recently 
been established by W. E. Lamb, Jr., and It. C. Retherford, Phys. Rev., 72, 241 (1947). 
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respectively. Since s < |fr|, the first of these expressions is positive or 
negative according as £ is positive or negative, whereas the second is 
always negative. Thus for ml ~ 0, k can assume only negative integer 
values. 

Thus far we have only shown that the j value of a level is equal to 
\k\ — In order to connect l with the level, we must make the non- 
relativistic approximation that the orbital angular momentum is well 
defined. Since in this case 0 in (44.14) is much larger than F, we can 
replace 0 by — 1 and d' by 6 in (44.10). Now in this approximation, 
(L + ih6)* = [1(1 + 1) + iW + h6 . L, and is also equal to j(j + l)^ 2 . 
We obtain in this way 

*-><!+D-*•+!>-*- ii\t\ 

As an example of the energy levels in hydrogen, we consider the case 
n = 3. The radial quantum number ml can be 0, 1, or 2, and k can be 
±(3 — ml) except that k can be only —3 when ml = 0. The levels 
with their nonrelativistic classifications are 


n f 

k 

l 

j 


0 

-3 

2 

4 


1 

2 

2 

4 

*£>a 

1 

-2 

1 

4 

'Pi 

2 

1 

1 

4 

'Pi 

2 

-1 

0 

4 

'Si 


According to (44.23) and (44.26), states with the same |/r| or j have the 
same energy; Eq. (44.27) shows that the energy increases with increasing 

i*i- 

Negative Energy States. We have seen that both the Schrodinger 
and Dirac relativistic equations admit of solutions for which a particle 
has negative kinetic energy and negative rest mass. These solutions 
correspond to the negative square root of the classical energy equation 
(42.2). The negative-energy solutions cannot be ignored in the quantum 
theory, as they are in the classical theory, since there is nothing to prevent 
a charged particle from making a radiative transition from a state of 
positive energy to a state of negative energy. 

Dirac proposed that we regard the negative energy states of EJq. 
(43.22) as being full, in which case the exclusion principle prevents transi¬ 
tions into such occupied states. The normal state of the vacuum then 
consists of an infinite density of negative-energy electrons. It is assumed 
that there are no electromagnetic or gravitational effects of these elec¬ 
trons, but that deviations from the norm produced by emptying one or 
more of the negative energy states can be observed. The absence of a 
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negatively charged electron that haB negative mass and kinetic energy 
would then be expected to manifest itself as a positively charged particle 
that has an equal positive mass and kinetic energy. In this way, a 
“hole” theory of positrons can be formulated. 

With so many electrons present, however, the theory is no longer the 
one-particle theory contemplated when the wave equation was set up. 
A many-particle theory can be baaed on the Dirac equation in accordance 
with the formalism of quantized fields discussed in the next chapter, and a 
theory of positrons can be developed. 

It might at first be thought that a similar technique cannot be applied 
to the Schrodinger relativistic equation, since it describes a particle of 
zero spin, which we expect to obey Einstein-Bose statistics rather than 
the exclusion principle. However, Pauli and Weisskopf 1 have shown 
that the quantized field energy is always positive in this case, even though 
the parameter E in the wave equation can be either positive or negative. 
Moreover, the charge in the quantized field can have either sign, cor¬ 
responding to the ambiguity of the sign of P noted after Eq. (42.8). 
Thus both the theories discussed in this chapter predict the existence of 
particles that have positive energies and both signs of electric charge. 
The appearance of spin angular momentum as a consequence of the Dirac 
equation shows that this is the theory that describes electrons. 

Problems 

1. Show that the exportation values of E* and p 2 for a general wave-packet solu¬ 
tion of Eq. (42.4) satisfy the equation (7? 2 ) = c a (p ,J ) + mW. Discuss the connection 
between this result, and the classical equation (42.2). 

2. Use the nonrelativistic approximation implied in (42.11) and in the immedi¬ 
ately following discussion to show that the expression (42.8) for P reduces to (7.1) in 
the proper limit. 

3. Solve the Schrodinger relativistic equation for an attractive square well poten¬ 
tial of depth 7 0 and radius a. Obtain an explicit expression for the minimum Vq with 
given a that just binds a particle of mass m. 

4. Show explicitly that the wave functions (43.17) and (43.18) are not eigen¬ 
functions of any component of the spin angular momentum JAd'. 

6. Show that any matrix with two rows and columns can bo expressed as a linear 
combination of <r r , a in o-., and 1. Use this result to show that there is no matrix that 
anticommuton with each of the first three of these. 

6. Show that the current density given by (43.20) for a freo-particlo wavo func¬ 
tion agrees with the corresponding nonrelativistic expression in the proper limit. 

7. Make use of Eqs. (43.11), (43.13), and (43.10) to verify Eq. (43.24). 

8. Prove that the operators a r and k defined by Eqs. (44,9) and (44.10) commute 
with each other, and that Wk* is given by the right side of (44.12). 

9. Discuss the connection between the « • E term in Eq. (43.27) and the spin-orbit 
energy. 

1 W. Pauli and V. Weisskopf, Helv. Phy*. Acta , 7, 709 (1934). 
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10. Show that the negative square roots that could appear in arriving at Eqs. 
(42.16) and (44.26) actually do not correspond to bound states. 

11. Show explicitly that elimination of feo/flo between the two Eqs. (44.28) gives 
the correct energy levels for n' = 0 if k <0, but not if k > 0. 

12. Use the selection rules A2 » ±1, A j =» 0, ±1 to list the frequencies of the 
allowed transitions between the states with n » 2 and n ™ 3 for the Coulomb field, in 
both the Schrtidinger and Dirac relativistic theories. In particular, show that the 
latter theory gives seven lines, of which five are distinct, whereas the former gives 
three lines that are much more spread apart. 

13. Solve the Dirac equation for an attractive square well potential of depth V o 
and radius a. Obtain an explicit expression for the minimum V o with given a that just 
binds a particle of mass m, and compare with the answer to Prob. 3. 



CHAPTER XIII 

THE QUANTIZATION OF WAVE FIELDS 


The theory of quantum mechanics presented thus far in this book has 
dealt with systems that, in the classical limit, consist of material particles. 
We wish now to extend the theory so that it can be applied to the electro¬ 
magnetic field, and thus provide a consistent basis for the quantum theory 
of radiation. The quantization of a wave field imparts to it some particle 
properties; in the case of the electromagnetic field, a theory of light 
quanta (photons) results. The field quantization technique can also 
be applied to a ^ field, such as that described by the nonrelativistic 
Schrfidinger equation (6.16) or by one of the relativistic equations (42.4) 
or (43.3). As we shall see (Sec. 46), it then converts a one-particle 
theory into a many-particle theory; in the nonrelativistic case, this is 
equivalent to the transition from Eq. (6.16) to (16.1) or (32.1), Because 
of this equivalence, it might seem that the quantization of \f/ fields merely 
provides another formal approach to the many-particle problem. How¬ 
ever, the new formalism can also deal as well with processes that involve 
the creation or destruction of material particles (radioactive beta decay, 
meson-nucleon interaction). 

This chapter and the next are intended to serve as an introduction to 
quantum field theory. 1 We start in Sec. 46 with a discussion of the clas¬ 
sical and quantum equations of motion for a wave field, without specify¬ 
ing the detailed nature of the field. The application to Eq. (6.16) is 
used as a first example in Sec. 46, since the analysis is relatively simple 
and uncomplicated by considerations of relativity. The quantization 
of the Dirac equation (43.3) is carried through in Sec. 47. Several other 
particle wave equations (including the relativistic Schrodinger equation) 
have also been quantized; they are mainly of interest in connection with f 
meson theory, and are not discussed here. The electromagnetic field is 
considered in the next chapter. 

*For further discussion, see G. Wentzel, “EinfUhrung in dio Quantentheorie der 
Wellenfeldcr,” (Franz Dcutickc, Vienna, 1943; reprinted by Edwards Bros, Inc,, 
Ann Arbor, 1946; English translation published by Interscience, New York, 1048); 
W. Heisenberg, “The Physical Principles of the Quantum Theory,” Appendix, Secs. 
9-12 (University of Chicago Press, Chicago, 1930); P. A. M. Dirac, “The Principles of 
Quantum Mechanics,” 2d ed., Chaps. XI, XIII (Oxford, New York, 1036). 
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45. CLASSICAL AND QUANTUM FIELD EQUATIONS 

A general procedure for the quantization of the equations of motion of 
a classical system was obtained in Sec. 23. We start with the Lagrangian 
function for the system, and verify that it gives the correct classical 
equations. The momenta canonically conjugate to the coordinates of 
the system are found from the Lagrangian, and a Hamiltonian function is 
set up. The classical Hamiltonian equations of motion are then con¬ 
verted into quantum equations by the substitution of commutator 
brackets for Poisson brackets. We now show how this procedure can 
be applied in its entirety to a wave field ^(r ,t), which we assume for the 
moment to be real. 1 

Coordinates of the Field. A wave field is specified by its amplitudes 
at all points of space and the dependence of these amplitudes on the 
time, in much the same way as a system of particles is specified by the 
positional coordinates Qi and their dependence on the time. The field 
evidently has an infinite number of degrees of freedom, and is analogous 
to a system that consists of an infinite number of particles. It is natural, 
then, to use the amplitudes ^(r^) at all points r as coordinates in analogy 
with the particle coordinates qi(t) of Sec. 23. 

It is not necessary, however, to proceed in this way. As an alterna¬ 
tive, we can expand ^ in some complete orthonormal set of functions 

^(r f t) = Sa*(f)ttit(r) (45.1) 

The expansion coefficients a*, in (45.1) can be regarded as the field 
coordinates, and the field equations can be expressed in terms of either $ 
or the a*. We shall use the wave amplitudes at all points as the field 
coordinates in this section. It will be convenient for some of the later 
work to make use of the coefficients a k . 

Lagrangian Equation. The Lagrangian L(qi,qi,t) used in Sec. 23 is a 
function of the time and a functional of the possible paths q%(t) of the 
system. The actual paths are derived from the variational principle 
(23.3): 

8 f Ldt = 0, 5tfi(£i) = 8qi(t 2 ) = 0 
Jt i 

By analogy, we expect the field Lagrangian to be a functional of the 
field amplitude ^(r,£). It can usually be expressed as the integral over all 
space of a Lagrangian density L: 

L = JX(^, grad t)dr (45.2) 

1 W. Heisenberg and W. Pauli, Zeits.f. Physik , 56, 1 (1929); 59, 168 (1930). 
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where $ = d\p/dt. The appearance of grad ^ as an argument of L is a 
consequence of the continuous dependence of $ on r (continuously infinite 
number of degrees of freedom); higher derivatives of ^ could also be 
present, but do not seem to arise in problems of physical interest. The 
variational principle that corresponds to (23.3) is 

8 J* Lit = 5 f t 1 ' j Ldtdr = j* j ( 8L)didr = 0 (45.3) 

where the infinitesimal variation 8\p of \f/ is subject to the restrictions 

= «^(r,<t) = 0 (45.4) 

If L has the form indicated in (45.2), its variation can be written 

8L = 8i + V ——— s(—) + ^7 8$ (45.5) 

dp d(d\j//dx) \cte/ d\p 

xyz 

where the summation over x, y, z implies the sum of three terms with y and 
z substituted for x. Now 8p is the difference between the original and 
varied i/s and hence is the time derivative of the variation of This 
and the similar expression for 8(d\]//da^j can be written 

^ 3 (^) = 4 (s ^ 

Equation (45.3) then becomes 

£ / [ S '* ■+ X mm s w > + % 5 <<•»]'**'-' 0 (46 - 6) 

* 1/2 

The summation terms in (45.6) can be integrated by parts with respect 
to the space coordinates; the surface terms vanish, either because ^ falls 
off rapidly enough at infinite distance, or because $ obeys periodic 
boundary conditions at the walls of a large but finite box. The last term 
of (45.6) can be integrated by parts with respect to the time, and the 
boundary terms vanish because of (45.4). Equation (45.0) can therefore 
be written 

[•• f {“ - V i 1 - i (!£)] HdUb - 0 (45.7) 

Jh J Z/ dx Lt)(#/Ar)J dt\d^J] 

*1/2 

Since (45.3) is valid for an arbitrary variation 5^ at each point in space, 
Eq. (45.7) is equivalent to the differential equation 

dL d I" dL ~| _ d / jjA __ Q 

“ Z/ dx [d(d^/dx)\ Jt \dyp) 


(45.8) 
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Equation (45.8) is the classical field equation derived from the Lagrangian 
density L(\p, grad 4, 4,t)- 

Functional Derivative. In order to pursue further the analogy with 
particle mechanics, it is desirable to rewrite Eq. (46.8) in terms of L 
rather than L. Since the aggregate of values of 4 and 4 at all points is 
analogous to the q< and of particle theory, we require derivatives of L 
with respect to 4 and 4 at particular points. These are called functional 
derivatives, and are denoted by BL/S4 and SL/&4- Expressions for them 
can be obtained by dividing up all space into small cells and replacing 
volume integrals by summations over these cells. The average values of 
quantities such as 4, grad 4, and 4 in the ith cell are denoted by subscripts 
i, and the volume of that cell by Sr<. Then 

^ L[4i, (grad 4)i, iMfo 

« 

approaches L in the limit in which all the fir* approach zero. 

In similar fashion, the t integrand in Eq. (45.6) or (45.7) can be 
replaced by 


sirxiissi—jca*- 

i xyz % 

where the variation in L is now produced by independent variations in the 
pi and the pi. Suppose now that all the 8fa and 8fa are zero except for a 
particular 8fa. It is natural to relate the functional derivative of L with 
respect to p for a point in the jth cell to the ratio of 8L to 8fa; we therefore 
define 

dL j. 8L dL d |" dL 1 / a* q\ 

dp 3 ff->o 8ipj8rj dp Lj dx \_d(dp/dx) J ^ 

xyz 

Similarly, the functional derivative of L with respect to p is defined by 
setting all the 8fa and 8fa equal to zero except for a particular Sfa: 


dL 1 . 8L dL 
— = lim —— = — 
dp o 8fa8rj dp 


(45.10) 


Here again the point r at which the functional derivative is evaluated is in 
the jth cell. Substitution of (45.9) and (45.10) into (45.8) gives 


±$L_dL 
dt dp dp 


(45.11) 


which closely resembles the Lagrangian equations (23.4) for a system of 
particles. 
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Hamiltonian Equations. The momentum canonically conjugate to 
tpi can be defined as in particle mechanics to be the ratio of SL to the infini¬ 
tesimal change 8^, when all the other 5^ and all the are zero. We 
thus obtain 


Pi 


8L 

8\f/j 



(45.12) 


It follows from (45.11) and (46.12) that 

(4613) 


The analogy with Eq. (23.5) then gives for the Hamiltonian 


H = 




t 


— L 


(45.14) 


We write H as the volume integral of a Hamiltonian density H y and 
assume that the cells are small enough so that the difference between a 
volume integral and the corresponding cell summation can be ignored; we 
then have 

H = [ Hdr, H = - L, tt = ^ ^ (45.15) 

J H d\p 


The approximate Hamiltonian (45.14), with the relations (45.12) and 
(45.13), can be manipulated in precisely the same way as the Hamiltonian 
for a system of particles. Instead of showing this explicitly, we now work 
with the true field Hamiltonian H given in (45.15), which is a functional 
of ^ and 7r from which ^ has been eliminated. The classical Hamiltonian 
equations of motion will be derived without further recourse to the cell 
approximation. The variation of L produced by variations of ^ and 
can be written, with the help of (45.11) and (45.15), 


5L 


f ^ dr — J (** + Trb$)dr 

= J[5(7 p/0 + nty — $&ir]dr 
= 5 IJ + 5L + J (7t6^ — i/&rr)dr (45.16) 


The variation of II produced by the corresponding variations of and t 
can be written 



334 


QUANTUM MECHANICS [Chap. XIII 


It follows from the earlier discussion of functional derivatives that 


dll _ dH _ TA d dH 

dij/ d\[/ dx ^{difjdx) 

xyz 

dH _ dH _ Y d dH 

dir dir d(dir/Sx) 

xyt 

Comparison of Eqs. (45.16) and (45.17) for arbitrary variations 
Sir then gives the classical field equations in Hamiltonian form 


(45.18) 

St and 


, ffl . dH 


(45.19) 


The Hamiltonian equation for the time rate of change of a functional 
F of $ and 7 r can now be found. We express F as the volume integral of 
the corresponding functional density jP(^, 7 t), which for simplicity is 
assumed not to depend explicitly on the time or on the gradients of ^ or 
7 T. The foregoing analysis can be used to show that 



- [F,H] (45.20) 

This equation also serves to define the Poisson-bracket expression for two 
functionals of the field variables. The right side of Eq. (45.20) is not 
changed if F also depends on grad $ or grad 7 r (see Prob. 2). It is appar¬ 
ent from (45.20) that I? is a constant of the motion if it does not depend 
explicitly on the time; in this case, H is the total energy of the field. 

Quantum Equations for the Field. The analogy between particle 
coordinates and momenta and the cell averages faPi suggests that we 
choose as quantum conditions for the field 

[Mi ] = [Pi,Pj] = 0, [fo,Pj\ = ihSij (45.21) 

where the commutator brackets are defined by Eq. (23.9). This means 
that we have converted the wave field from a numerical function to an 
operator that can be represented by a matrix as in Sec. 23. 

We now assume that the cell volumes are very small. Then Eqs. 
(45.21) can be rewritten with the help of (45.12) and (45.15) in terms of 
^ and 7 r: 

['K r >i),'Kr',i)] = k(r,<),ir(r',i)] = 0 
[f(r,<),ir(r / ,0] = ihS(t,t') 
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where 6(r,r') = 1/dn if r and r' are in the same cell and zero otherwise. 
The function 5(r,r') has the property that J/(r)5(r,r')dr is equal to the 
average value of / for the cell in which r' is situated. Thus in the limit in 
which the cell volumes approach zero, 5(r,r') can be replaced by the three- 
dimensional Dirac 5 function 5(r ~ r'), defined in Eq. (11.14). The 
quantum conditions for the canonical field variables then become 


[iKr,Z)>(r7)] = Mr,0,ir(r',«)] = 0 
[iKr,/V(r7)l = - r') 


(45.22) 


The equation of motion for any quantum dynamical variable F is 
obtained as in Sec. 23 if the Poisson bracket in Eq. (45.20) is replaced by 
the commutator bracket divided by ih. 


ihF = [F,H] 


(45.23) 


The commutator bracket can be evaluated with the help of (45.22) when 
explicit expressions for F and H in terms of and 7r are given. Thus Eqs. 
(45.22) and (45.23) completely describe the behavior of the quantized 
field that is specified by the Hamiltonian IT. 

Fields with More than One Component. Thus far in this section we 
have dealt with fields that can be described by a single real amplitude. If 
the field has more than one component \l/ h i/% ... , the Lagrangian den¬ 
sity has the form L(\p h grad ^ 2 , grad ^ 2 , ^ 2 , - . - ,t). Then if 

each of the field components is varied independently, the variational 
equation (45.3) leads to an equation of the form (45.8) or (45.11) for each 
of \f/ 2 , . . . . A momentum canonically conjugate to each \f/ a can be 
defined as in Eq. (45.15) to be 7r„ — dL/di/^. The Hamiltonian density 
has the form 

(45.24) 


H = X ,r ^“ _ L 


and the Hamiltonian equations consist of a pair like (45.19) for each s. 
Equation (45.23) is unchanged, and the commutation relations (46.22) are 
replaced by 

f^(r,/),^(r',/)] = [7r fl (r^),7r^(r',/)] = 0 
[^(r,/.),7iv(rV)] = ih5 ai ,'5(r - r') 


(45.25) 


A case of immediate interest is a single complex field 1 ]/, for which we 
can write 

\f/ = 2~*0/'i + i\f/ 2 ) f # = 2“*( 1 /'i ifa) (45.26) 

where ^1 and are real. We show first that the equations of the form 
(45.8) obtained by independent variation of i/> and of # are equivalent to 
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those obtained by variation of and ^ 2 . It follows from (45.26) that 



Thus the sum and difference of the if i equation and i times the equa¬ 
tion give the Lagrangian equations that result from independent varia¬ 
tion of if and if, respectively, in (45.3). In similar fashion, the momenta 
canonically conjugate to ^ and # are seen to be 

ir = 2-*(tti — zV 2 ) and t = 2-*(7n + ix 2 ) (45.27) 

respectively. 1 Then irvi'i + 7^2 = 7nf + and the Hamiltonian is 
unchanged. The commutation relations for if, x, and f can be obtained 
from Eqs. (45.25) (with 5 = 1 and 2), together with (45.26) and (45.27), 
All pairs of variables except the following commute: 

[^(r,0,7r(r',0] = tt*(r - r') / 46 2g) 

IKtA,*FA] = «*(r - O ^ * 

46. QUANTIZATION OF THE NONRELATIVISTIC SCHRftDINGER EQUATION 
As a first example of the application of the field quantization technique 
developed in the preceding section, we consider here the quantization of 
the nonrelativistic Schrodinger equation (6.16). This application implies 
that we are treating Eq. (6.16) as though it were a classical equation that 
describes the motion of some kind of material fluid. As we shall see, the 
resulting quantized field theory is equivalent to a many-particle Schr6- 
dinger equation, like (16.1) or (32.1). For this reason, field quantization 
is often called second quantization; this term implies that the transition 
from classical particle mechanics to equation (6.16) constitutes the first 
quantization. 

Lagrangian and Hamiltonian Equations. The Lagrangian density 
may be taken to be 

L = ih\fs\j/ — grad if • grad \p — V(r,£)ifif (46.1) 

As shown at the end of the last section, if and if can be varied separately to 
obtain the Lagrangian equations of motion. The equation of the form 
(45.8) that results from variation of 1 f is 

- h 2 

which is the complex conjugate of Eq. (6.16). Variation of if gives Eq. 
(6.16): 

1 See footnote 1, page 337. 
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*2 

iH = -^VV+V(iM ( 46 . 2 ) 

The momentum canonically conjugate to ^ is 

ir = —y = ifiip (46.3) 

d\y 

However ^ does not appear in the Lagrangian density, so that ff is identi¬ 
cally zero. 1 It is therefore impossible to satisfy the second of the commu¬ 
tation relations (45.28) (or the corresponding classical Poisson-bracket 
relation), so that cannot be regarded as a pair of canonically conjugate 
variables. They can easily be eliminated from the Hamiltonian since 
never appears and Eq. (46.3) gives # in terms of 7 r. 2 
The Hamiltonian density is 

H = W' - L = - g— grad ?r • grad ^ ^ Vinf/ (46.4) 


The Hamiltonian equations of motion obtained from (45.19), with the 
help of (45.18), are 




= i v- 
h 




The first of these equations is the same as (46.2), and the second equation, 
together with (46.3), is the complex conjugate of (46.2). We have thus 
shown, from the point of view of classical field theory, that the Lagrangian 
density (46.1), and the canonical variables and Hamiltonian derived from 
it, are in agreement with the wave equation (6.16) or (46.2). 

Quantum Equations. The quantum equations are obtained by 
adopting the volume integral of (46.4) as the Hamiltonian, (45.23) as 
the equation of motion, and the first of Eqs. (45.28) as the quantum condi¬ 
tion on the wave field. Since $ is now an operator rather than a nu- 

1 The notation of Eqs. (45.27) does not necessarily imply that 7 f is the complex 
conjugate of 7 r, since these quantities were defined as canonical momenta. It is not 
difficult to see that t and t aro actually complex conjugates of each other if L is real; 
in this case the 71-1 and 7 r 2 that appear in (45.27) arc also real. 

a The conclusion that r can be identified with $ is related to the appearance of only 
the first-order time derivative in the wavo equation (46.2), since in this ease can be 
expressed in terms of ^ and its space derivatives through the wave equation. If the 
wave equation is of second order in tho time derivative, \p and ^ are independent; then 
7 r is related to i p rather than to #, and both ^, 7 r and \J / f x are pairs of canonical variables. 
The nonrelativistic Schrttdinger equation and the Dirac equation are of the former 
type, while the relativistic SchrSdinger equation is of the latter type. 
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merical function, # is to be interpreted as the Hennitian adjoint of \p rather 
than as its complex conjugate. We therefore follow the notation of Sec. 
21 and denote it by \j/*. It is convenient to make use of Eq. (46.3) to 
replace x by in which case the Hamiltonian becomes 

H = J grad ■ grad ^ + 7*v)* (46.6) 

Application of Eqs. (21.14) and (21.15) then shows that H is Hennitian. 
The quantized Hamiltonian given in (46.5) is the operator that represents 
the total energy of the field; it is not bo be confused with the operator 
given in (22.2), which is the energy operator for a single particle that is 
described by the wave equation (6.16) or (23.1). We have as yet giv en 
no explicit representations for the new operators \[/ and H , and therefore 
cannot say on what they operate. The choice of a particular representa¬ 
tion is not necessary so far as the equations of motion are concerned, but 
is desirable in the physical interpretation of the formalism that we give 
later in this section. 

The commutation relations are 


['K*W(?')] = W'*(r),**(r / )] = 0 

[+<?)#*<?)] = 5(r - r') 


(46.6) 


The omission of t from the argument of the field variables implies that 
both fields in a commutator bracket refer to the same time. 1 
Substitution of $ for F in Eq. (45.23) gives 


ih\j/ 


[f,H] 

f 5m grad ' **' ‘ e rad ' + |V, J yV*V'A-'J (46.7) 


where primes indicate that an integration variable r' has been substituted 
for r. The second term on the right side is easily evaluated with the 
help of (46.6) to give 


JF(W*y - ^*W)dr' = JF'(W*' - ^'Wdr' 

= j7V'5(r - r ')dr' = (46.8) 

commutes with V , which is a numerical function. Evaluation of the 
first term on the right side of (46.7) is simplified by performing a partial 
integration on /grad' $*' ■ grad' Vdr ' to obtain -J^VV'dr'; the surface 
terms vanish because either vanishes at infinity or obeys periodic bound- 


1 Commutation relations between quantities that refer to different times are of 
interest in connection with relativistic theories (see See. 47). 
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ary conditions. We thus obtain 

ty'jJ'grad' }//*' • grad' i p'dr'] = — bPj^'v'^dr 1 ] 

= -J(V'V)«(r - r')dr' = -vY (46.9) 

Substitution of (46.8) and (46.9) into (46.7) yields Eq. (46.2), so that the 
equations obtained from classical and quantum field theory agree. A 
similar calculation shows that the equation yields the 

Hermitian adjoint of Eq. (46.2); it can also be seen directly that this 
equation is the Hermitian adjoint of the equation ih}/ = Y,if] so long as 
H is Hermitian. 

If V is independent of t , H has no explicit dependence on the time, 
and Eq. (45.23) shows that II is a constant of the motion. Thus the 
energy in the field is constant. Another interesting operator is 


N = Jf*\pdr 

which we assume represents the number of particles in the field. We note 
first that N is Hermitian. Its time derivative is given by 


ihN = 


N>H] 


J i p*$dT, J grad' ^ • grad' + FY*ty'^ dr ' (46.10) 


The commutator of N with the V part of II can be written as 

JfV' Y*i/Y*Y' - i 

with the help of (46.6) the parenthesis in the integrand is 


}}/ — yf/*' fi(r — r')]^' — 

= - r') - 

= yf,*'[- 5(r - r')]^ + Wi(r - r') 


= 0 


since the 5 function vanishes unless r = r'. A similar but slightly more 
complicated calculation shows that 


Y’V, grad' ■ grad' \f/'] = Y* grad' \p' — (grad' * grad' 5(r — r') 

The double integral of this over r and r' is zero. Thus Eq. (46.10) shows 
that N is a constant of the motion. 

It can also be shown that the commutator brackets in (46.6) are con¬ 
stants of the motion, so that they are always true if they are true at a 
particular time. 
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The N Representation. We now specialize to a representation in 
which the operator N is diagonal. Since N is Hermitian, its eigenvalues 
are real. A convenient and general way of specifying this representation 
is by means of an expansion like (45,1) in terms of a complete orthonormal 
set of functions ^(r), which we assume for definiteness to be discrete. 
We put 



(46.11) 


where the u h are numerical functions of the space coordinates and the a* 
are operators that depend on the time. Equations (46.11) can be solved 
for the a*: 


a h {t) = Jti*(r)^(r,£)dr, at = Jw*(r)^*(r,0dr 


Thus if we multiply the last of the commutation relations (46.6) by 
tf*(r)wz(r') on both sides and integrate over r and r', we obtain 

[a>h(t) } a?(2)J = JM(r)wj(r')5(r - r r )drdT f = hi (46.12) 


because of the orthonormality of the u h . In similar fashion, it is apparent 
that an and ai commute, and that a * and af commute, for all k and L 
Substitution of (46.11) into the expression for N shows that 



(46.13) 


It is easily seen that each Nk commutes with all others, so that they can 
be diagonalized simultaneously. 

In order to find a representation in which N and all the Nh are diag¬ 
onal, we write each of the a h in the form 

a k = 2-ifo + ip k ), at = 2 ~i(q k - ip k ) (46.14) 

where q k and p k are Hermitian. This is always possible since Eqs. 
(46.14) can be solved to give 

q h = 2~*(a A + a£), p h = — i2~i(a k — at) 

and these operators are evidently Hermitian. It follows from (46.12) 
that 

[q^qi] = [Pk,pi] = 0, [qk,Vi] = ihi (46.15) 

and that 

Nk = iivl + ql) - £ (46.16) 

Equations (46.15) and (46.16) have the advantage over the equivalent 
earlier equations (46.12) and (46.13) that their solution has already been 
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obtained in connection with the theory of the linear harmonic oscillator. 
We now show that some of the results of Sec. 13 provide explicit matrices 
for the pk and q h that make the Nk diagonal. 

Connection with the Harmonic Oscillator. The quantum motion of a 
particle of mass m acted on by a force — Kx, where x is the displacement 
from equilibrium, was discussed in Sec. 13 from the point of view of the 
Schrtidinger equation. As shown in Sec. 22 and 23, the solution of this 
problem is equivalent to the diagonalization of the energy matrix 

«£+*** 2 

when the coordinate x and the canonically conjugate momentum p satisfy 
a commutation relation like (23.13): 

[x } p] — ih 

The energy eigenvalues were found to be given by Eq. (13.8): 

(n + i)h (|)*’ n = 0 , 1 , 2 , • ■ ■ 

The matrix for a; in the representation in which the energy is diagonal is 
given by Eqs. (13.18). 1 

If now we identify x with q k) p with p kj and set ft, m, and K equal to 
unity, we see at once that N k + i can be identified with the energy of the 
oscillator and has the eigenvalues n k + -J-, where Uk is a positive integer or 
zero. Equations (13.18) then show that the matrix for q k has the form 

(®0 n.,n t+ l = (<7fc)«»+l.n» = (46 ' 17 ) 

with other matrix elements vanishing. A calculation similar to that 
which resulted in (13.18) can be used to show that the matrix for p k has 
the form 

i = (46.18) 

with other matrix elements vanishing. The matrices for a k and a* can 
then be obtained from (46.14): 

Wn*,n*+1 = (a*)n*+l.n* = (™A, + 1)* (46.19) 

All other matrix elements vanish. Equations (46.17), (46.18), and (46.19) 
imply that the nonvanishing matrix elements refer to pairs of states for 

1 These results can also be obtained by purely matrix methods, without explicit 
solution of the Schrodingcr equation; see Dirac, op, cit. f Sec. 36. 
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which all the other have the same values, since q k , p kj a ky and a £ each 
commute with Ni for l ?£ k. 

The matrix for x in the harmonic-oscillator problem can be related to a 
set of wave functions u n (x) given by (13.13), such that 

Xnn' = fun(x)xu n '(x)dx. 

We also expect that the matrices for qu 7 p k , and can be related in a 
similar way to some kind of quantities that play the role of wave functions 
in the quantized field theory. We call these quantities wave functionals 
^ of the numbers n k ) they can be thought of as one-column matrices that 
have the properties 


. . . ,n k , . . . )*« . . . X, • • - ) 

^f*{nij . . . ffijfj . . . )ak^(nij . • . 7 n k7 . . ■ ) (46.20) 

5=5 (nje + l)^ nini <! • " ‘ 5 njb 4,i,njfc' 

**(n i, . . . ,n ki . . . . . . X, . . . ) 

— njJ 1 8 nini t * * • 5 njb _i,n*' 

in agreement with Eqs. (46.19). Equations (46.20) are equivalent to 
the statement that the 4>'s are orthonormal and satisfy the relations 

a h ^(n h . . . 7 n k} . . . ) 

= n**¥(m f . . . 7 n k - 1, . . . ) (46 21) 

a$*(n h . . . ,n k , . . . ) 

= (n k + lJ^Cni, . . . 7 n k + 1, . . . ) 

Physical Interpretation. Since we regard N as the operator for the 
total number of particles in the quantized field, it is reasonable to assume 
that N k is the operator for the number of particles in a state of the field that 
is described by the space function u k ( r). We thus have the result that 
a precise measurement of the number of particles in any state must 
be one of the positive integers or zero. Equation (46.13) then shows that 
this also applies to the total number of particles. 

Although N is a constant of the motion, N k need not be. Substitu¬ 
tion of N k for F in Eq. (45.23) gives 

ihN k = [a%a k ,H] 

H can be expressed in-terms of the a k by substituting (46.11) into (46.5). 


= J grac * ^ Ul + VUjUi )dr 

ji 


(46.22) 



Sec. 46] 


THE QUANTIZATION OF WAVE FIELDS 


343 


It is not difficult to show from (46.12) that a particular N * is constant if 
and only if all the volume integrals in (46.22) are zero for which either 
j or l is equal to k. These integrals are just the matrix elements of the 
one-particle Hamiltonian (22.2), so that the necessary and sufficient con¬ 
dition that Nk be a constant of the motion is that all such off-diagonal 
elements that involve the state u k be zero. 1 

The case in which the u k are eigenfunctions of (22.2) with eigenvalues 
Eic is of particular interest. The integrals in (46.22) are then Eibji, and 
the field Hamiltonian becomes 

II = £ a* k a h E k = ^ NtE k (46.23) 

k k 

This particular N representation is one in which H is also diagonal; the 
wave functional . . . ,n h . . . ) has the eigenvalue ^ n h E k for 

the total-energy operator II. It is apparent that all the N k are constant 
in this case. 

The first of Eqs. (46.21) permits the interpretation of a* as a destruction 
operator for particles in the state k } since it converts a wave functional into 
a multiple of one that has one less particle in that state. Similarly, 
a% can be interpreted as a creation operator , since it increases the number 
of particles in the Hh state by unity. 

Connection with the Many-particle Schrodinger Equation. The 
quantized field theory is closely related to the many-particle Schrodinger 
equation discussed in Sec. 32. If the u k are eigenfunctions of the one- 
particle Hamiltonian (22.2), the field theory shows that stationary solutions 
exist for which the number of particles n k in the A’th state is a con¬ 
stant positive integer or zero, and the energy is ^ n&B*. Each solution 

k 

can be specified by a wave functional ^(wi, . . . ■ . . )> the 

form a complete set, and there is just one solution tor each set ol numbers 
tti, . . . . A stationary many-particle wave function like the 1/ in 
Eq. (32.1) can be written as a product of one-particle wave functions 

u k (i)e if there is no interaction between particles. The linear com¬ 
bination of such products that is symmetric with respect to interchange ol 
any pair of particle coordinates can be specified uniquely by stating the 
number of particles in each state. Again, the number of particles in 
each state is a positive integer or zero, and the energy is the sum of all 
the particle energies. 

1 This result for the quantized field in closely related 1o the corresponding result., 
contained in IGq. (29.5), for the one-particle probability amplitude. 
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We see then that the quantized field theory developed thus far in 
this section is equivalent to the Schrodinger equation for several non¬ 
interacting particles, provided that only the symmetric solutions are 
retained in the latter case. We are thus led to a theory of particles that 
obey Einstein-Bose statistics. It can be shown that the two theories are 
completely equivalent even if interactions between particles are taken 
into account. 1 

It is natural to see if there is some way in which the quantized field 
formalism can be modified to yield a theory of particles that obey Fermi- 
Dirac statistics. As discussed in Sec. 32, a system of such particles can 
be described by a many-particle wave function that is antisymmetric with 
respect to interchange of any pair of particle coordinates. The required 
linear combination of products of one-particle wave functions can be 
specified uniquely by stating the number of particles in each state, pro¬ 
vided that each of these numbers is either 0 or 1. The desired modifica¬ 
tion of the theory must, therefore, limit the eigenvalues of each operator 
Nh to 0 and 1-. 

Anticommutation Relations. A review of the foregoing theory shows 
that the conclusion that the eigenvalues of each Nk are the positive 
integers and zero stems from the commutation relations (46.12) for the 
ah and a*. Equations (46.12) in turn arise from the commutation rela¬ 
tions (46.6) for * and **. Thus we must modify Eqs. (46.6) if we are to 
obtain a theory of particles that obey the exclusion principle. It is 
reasonable to require that this modification be made in such a way that 
the quantum equation of motion for \p is the wave equation (46.2) when 
the Hamiltonian has the form (46.6). 

It was found by Jordan and Wigner 2 that the desired modification con¬ 
sists in the replacement of the commutator brackets [A,B] a AB — BA 
in Eqs, (46.22) and (46.6) by anticommutator brackets [A } B] + = AB + BA. 
This means that Eqs. (46.6) are replaced by 

[*(r)>*(r')]+ = *(r)*(r') + *(r ')*0O = 0 
[**(r)i**0O]+ = **(0**00 + **00**00 = 0 (46.24) 

[*(0,**(O]+ = *(0**(O + **(O*(0 = 5(r - O 

It then follows directly from Eqs. (46.11) and (46.24) that 

[&k f <Li]+ = a k ai + aiak = 0 

[ctf ,a?] + = at af + afat = 0 (46.26) 

[a*,a?]+ = a k af + afa k = 8 kt 


1 See Heisenberg, op. cit., Appendix, Sec. 11. 

2 P- Jordan and E, Wigner, Zeits. f. Physik, 47, 031 (1928). 
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We define N k = o% a h as before, and notice first that each Nk com¬ 
mutes with all the others, so that they can be diagonalized simultaneously. 
The eigenvalues of Nk can be obtained from the matrix equation 

Nl = a%a k a%a k 

= a*( 1 “ a%a k )a k = aja* = N k (46.26) 

where use has been made of Eqs. (46.25). If N k is in diagonal form and 
has the eigenvalues n f h , n k , , it is apparent that Nl is also in diagonal 
form and has the eigenvalues njj?, a , .... Thus the matrix equation 
(46.26) is equivalent to the algebraic equations 

n'S = K, 

for the eigenvalues. These are quadratic equations that have two roots: 
0 and 1. Thus the eigenvalues of each Nk are 0 and 1, and the particles 

obey the exclusion principle. The eigenvalues oi N = ^N k are the 

k 

positive integers and zero, as before. The earlier expressions (46.22) and 
(46.23) for the Hamiltonian are unchanged, and the energy eigenvalues 

are ^ nrfSh. 

Equation of Motion. In order to find the quantum equation of motion 
for ^ when the Hamiltonian is given by (46.5), we must decide whether 
or not the general equation of motion (45.23) is to be retained. The latter 
equation was obtained by replacing the Poisson bracket by the com¬ 
mutator bracket in the classical equation (45.20). This replacement was 
justified by analogy with the particle theory of Sec. 23, by the identical 
algebraic properties of the two kindB of bracket expressions given in 
(23.12), and by the correspondence-theory argument of Prob, 10, Chap. 
VI. Thus abandonment of Eq. (45.23) means that the classical equation 
(45.20) is likewise abandoned; since many of the quantities with which we 
are concerned (number of particles, energy, etc.) have well-defined clas¬ 
sical analogues, we shall retain (45.23) as the general quantum equation 
of motion. 

The equation for $ is then given by (46.7), where now the anticom¬ 
mutation relations (4G.24) arc to be used in evaluating the right side. 
This causes (46.8) to be replaced by 

JT'(^*y - p)dr' = JEW*' + ^'^Wdr' 

= JTV'5(r - rOdr' = V$ 


The similar treatment of the first term on the right side of (46.7) gives 
no change in the right side of (46.9). Thus the wave equation (46.2) is 
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unaffected by the substitution of anticommutation for commutation rela¬ 
tions between the wave amplitudes. It can also be shown without dif¬ 
ficulty that N and the anticommutator brackets in (46.24) are constants 
of the motion. 

Physical Implications of Anticommutation. Since anticommutator 
brackets do not possess the algebraic properties of Poisson brackets, we 
can conclude that there is no classical analogue for the quantities ^ and 
a h that satisfy the relations (46.24) and (46.26). This does not mean, 
however, that N and H do not possess classical limits, for they are con¬ 
structed of bilinear combinations of ^ or a* and commute with each other. 

These conclusions can be confirmed on the basis of physical considera¬ 
tions. In order that a field amplitude be strong enough to be classically 
measurable, it must be possible to have a very large number of particles 
in the same state so that their fields are coherent. This implies that such 
particles obey Einstein-Bose statistics. We can, for example, conclude 
that light quanta or photons obey Einstein-Bose statistics, since it is 
known that strong electric and magnetic fields can be produced and are 
classically measurable. In the case of metallic electrons, which obey 
Fermi-Dirac statistics, quantities like energy, and charge and current 
density, are classically measurable since they can be expressed as bilinear 
combinations of the field amplitude, while the electron field amplitude 
itself is not. 1 

Representation of the Anticommuting a k Operators. An explicit 
representation for the operators that appear in (46.25) is easily obtained 
in the hypothetical but instructive case in which the system has only one 
state. The matrix equations to be solved are 

a 2 = a* 2 = 0, aa* + a*a = 1, N = a*a (46.27) 

Equations (46.27) are just those solved in Prob. 2, Chap. VI. We have 
already seen in (46.26) that N 2 = N , so that N has the eigenvalues 0 and 
1. If there is no degeneracy, N can be represented by the diagonal matrix 

N = ({[ J) (46.28) 

It is interesting to note that a cannot be diagonalized since N has a non¬ 
vanishing eigenvalue. If it could be, the first of Eqs. (46.27) would show 

1 The classical measurability of the field amplitude for any charged particle 
(Einstein-Bose or Eermi-Dirac statistics) implies that \p appears linearly in H } since 
the energy must then depend on i p itself as well as on bilinear combinations of This 
in turn implies that terms linear in a k or aj appear in the Hamiltonian, so that single 
charged particles can be destroyed or created. Thus ^ cannot be measured classically 
if the theory is such that electric charge is conserved. 



Sec. 46] 


THE QUANTIZATION OF WAVE FIELDS 


347 


that the square of each of its eigenvalues is zero; this would mean that a, 
and hence a* and N , are identically zero, so that N could not have the 
form (46.28) in any representation. 

Explicit matrices for a and a * that agree with (46.27) and (46.28) are 

°-(o J) “*-(? S) <4629) 

The two wave functionals that represent the two possible states of this 
system can be expressed as 

*(0) = (J), *(1) = (J) (46.30) 

It is easily seen that the first of (46.30) has the eigenvalue 0 for the opera¬ 
tor N given in (46.28), and the second wave functional has the eigenvalue 
1. The following relations are easily obtained from Eqs. (46.29) and 
(46.30): 

a$r(n) = n^( 1 — n), a*M>(rc) = (1 — n)^(l — n), n = 0,1 (46.31) 

Thus a and a* again play the roles of destruction and creation operators, 
respectively. 

In the actual problem, the number of states of the system is infinite, 
and it is not convenient to write down explicit matrices like those in 
(46.28), (46.29), and (46.30). We can, however, find the effects of opera¬ 
tion with dk and a* on a wave functional SE^Wi, . . . ,nk, . . . ) that has 
the eigenvalue Uk ( = 0 or 1) for the operator -A/V The desired relations 
would have the form (46.31), were it not that a scries of such equations 
(with subscripts added) would not agree with the first two of Eqs. (46.26). 

We therefore proceed in the following way. We order the states k of 
the system in an arbitrary but definite way: 1, 2, . . . , 7r, . . . . Then 
the effect of operating with (Mich a* or a* on SF has the form (46.31), 
except that a multiplying plus sign or minus sign is introduced, according 
as the frth state is preceded in the assumed order by an even or an odd 
number of occupied states. We thus replace Eqs. (46.21) by 

■ ) 

= O k n k A'(n h ... ,1 - n ki . . . ) 

= 0*(1. - tt*)*(n, f . . . ,1 - ru, . . . ) ( 46 - 32 ) 

k- 1 

i "i 

j-i 


a k y(n i, . . . ,n A ., . . 
- • • ,n k9 . . 


Ok = (- 1 )"*, 


Vk = 
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As an example, we calculate the effect of operating with a k ai and with 
aidk on a wave functional where we assume for definiteness that the 
order is such that l > Zr. If each operation is not to give a zero result, 
both n h and n z in the original & must equal unity. Operation with a k ai 
empties first the Zth and then the &th state, and introduces a factor 0z0*,. 
Operation with a z a k empties the Kh state first, so that 6 k is unchanged. 
But when the Zth state is emptied in this case, there is one less particle in 
the states below the Zth than there was in the previous case, since the Zrth 
state is now empty, whereas it was occupied before. Thus the sign of 
0 Z is changed. We find then that a k ai$r = — in agreement with 

the first of Eqs, (46.25). In similar fashion, it can be shown that Eqs. 
(46.32) agree with the result of operating with the other two of Eqs. 
(46,25) on any SF. Since the aggregate of wave functionals represents 
all possible states of the many-particle system, the Sf^s constitute a com¬ 
plete set, and Eqs. (46.25) follow as operator equations from Eqs. (46.32). 

47. QUANTIZATION OF THE DIRAC EQUATION 

As our second example, we now consider the quantization of Dirac’s 
relativistic equation (43.3) for a free electron. Our procedure again 
consists in treating the one-particle equation as though it were a classical 
field equation. The resulting quantized field theory represents the 
motion of a number of noninteracting free electrons. 

Lagrangian and Hamiltonian Equations. The Dirac wave function ^ 
has four components that we denote by (j = 1, 2, 3, 4). The Lagran¬ 
gian density may be taken to be 

L == ^ fa (ihfa — ihc ^ an ■ grad fa + me 2 ^ ft-z^z) (47.1) 

3 l l 

where the matrices an and ft- z are given in Eqs. (43.12). We must now use 
the multicomponent extension of the field theory described at the end of 
Sec. 45. Variation of one of the components \j/ z of \f/ leads to an equation 
of the type (45.8): 

me 2 ^ 'fri&n + ihc ^ grad ^ • aji — ih\pi = 0 (47.2) 

3 3 

The four equations like (47.2) may be written together as 

— + ihc grad \[/ ■ a + = 0 

where is the Hermitian adjoint of \j/ and is a matrix with one row and 
four columns; this equation is the Hermitian adjoint of (43.3). In similar 
fashion, it is easily shown that separate variation of the four fields 
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results in four equations that can be written together as 

ihyp — ihcoL • grad ^ + rac 2 ^ = 0 (47.3) 

which is the same as the Dirac equation (43.3). 

The momentum canonically conjugate to fa is 

= Jr = (47.4) 

Hi 

As with the nonrelativistic SchrSdinger equation, the momentum fr/ 
canonically conjugate to fa is identically zero; we therefore eliminate 
fa from the Hamiltonian with the help of (47.4). Then the Hamiltonian 
density is 

H - ^ ~ L 

j 

= c ^ TjOLji • grad fa + ^ TjPjtfa (47.5) 

jl il 

It is not difficult to show (see Prob. 12) that the first of the Hamiltonian 
equations (45.19) is the same as (47.3), and that the second Hamiltonian 
equation, together with (47.4), giyes the Hermitian adjoint of (47.3). 
This completes our demonstration that (47.1), (47.4), and (47.5) are in 
agreement with the Dirac equation. 

Quantum Equations. As with the nonrelativistic Schrodinger equa¬ 
tion, it is convenient to rewrite the Hamiltonian with the help of (47.4) 



= j(ihnl/*ct ■ grad ^ — mc^^pfadr (47.6) 

In spite of its appearance this expression is actually real, as can be seen 
by performing a partial integration on half of the first term to obtain 

H = S[h’hc(fa'a ■ grad — grad ^ • a^) — mety^pfadr (47.7) 

The surface terms vanish because ^ either vanishes at infinity or obeys 
periodic boundary conditions. 

We have already seen (Sec. 32) that electrons obey the exclusion 
principle. We therefore quantize the field by imposing anticommutation 
relations on the components of \p. With the help of (47,4) these become 

[&(r),Mr')] + = - 0 

[fc(r),*,*(r')] + = M(r - r') 


(47.8) 
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The substitution of \p* for $,■ implies that each fa is now a quantum- 
mechanical operator rather than a numerical function; we interpret \f/* to 
mean the matrix that has one row and four columns, whose elements are 
the operators ypf. 

The quantum equation for ^ is obtained by substituting fa for F in 
Eq. (45.23), where H is given by (47.6) or (47.7). 

iMi = [to, f (ihc Y jft'au • grad' to - me 2 Y tt'M'i) dr' 1 (47.9) 

Prunes indicate that an integration variable r' has been substituted for r. 
The second term on the right side can be evaluated with the help of (47.8) 
as follows: 

[to, / YtofWjir' 1 

H J 

= -me 2 Y f [to^rtoJ^r' 

id 

= -«c 2 T / (totoTto - to*'toto)dr' 

¥ 

= - mc ’ 2 y Au f (W + ttWtdr' 

hi 

~ —me 2 ^ fikdjk J to 5 (r — r')dr' — — me 2 ^ /3, 7 to 

Evaluation of the first term on the right side of (47.9) is accomplished in 
the same way, since to anticommutes with grad' to as well as with to: 

[to, / ihc y • grad' toA'] = t%e Y «, 7 • grad to 

X 

Thus the four equations like (47.9) are equivalent to the Dirac equation 
: ' s ^ m ^ ar calculation shows that the four equations of the form 

l "to = Lto*,#] yield the Hermitian adjoint of (47.3). 

The operator iV for the total number of electrons in the field can be 
written 

N = / = f y toWr (47.10) 

j 

It is apparent that 27 is Hermitian, and it can also be shown that 

ihN = [27, H] = 0 

so that 27 is a constant of the motion (see Prob. 13). As with the non- 
relativistic theory of Sec. 46, the anticommutator brackets in (47.8) can 
be shown to be constant in time. 
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The N Representation. A representation in which N is diagonal is 
conveniently found by expanding in plane wave solutions of the Dirac 
equation for a single free electron. The entire procedure is exactly 
analogous to that followed in the last section, but is complicated by the 
multicomponent nature of the Dirac field. We shall continue to denote 
the four components of ^ and of the plane waves by a subscript j or l . 
The propagation vector of a plane wave (momentum divided by A) will 
be denoted by k, and it will be assumed that these waves obey periodic 
boundary conditions at the walls of a large cubical box of edge length L. 

We saw in Sec. 43 that there are four solutions for each value of k; 
these will be distinguished by the letter a (= 1, 2, 3, 4), so that both k and 
s must be given in order to specify a one-electron plane wave solution. 
The free electron solutions that form a complete orthonormal set of func¬ 
tions within the volume 1? are then 


?;,-(k,s;r) = ti,-(k,s)L“S exp ik • r (47.11) 

The Uj(k,s) are numbers obtained by multiplying the four sets of Uj given 
in Eqs. (43.17) and (43.18) through by the normalizing factor given there. 
The two solutions (43.17) will be denoted by s = 1, 2, and correspond to 
the two spin orientations for the positive energy: 


E k „ = +(fiV 2 k 2 + m 2 c 4 )*, 

s = 1,2 

(47.12) 

The two solutions (43.18) are for negative energy: 



E k , = — (ft 2 c 2 k 2 + w¥)», 

8 = 3,4 

(47.13) 


The orthonormality relations for the Vj of Eq. (47.11) are easily seen to be 


j 7 ?“y(k,.s;r)^(k'/;r)r/r = 5 kk /5 iut > 


(47.14) 


As in Eqs. (40.11), we expand 1/7 and \p* in terms of the vf. 

fc(r»0 = y «(k,s; O'VOM; r) 


L-! 


tf(r,0 = 2 a*(k,«;f)B/(k,*;r) 

k» 


(47.15) 


The coefficients a and a * are quantum-mechanical operators that depend 
on. the time. With the help of the orthonormal ity property (47.14) it is 
not difficult to show that the anticommutation relations (47.8) are equiva¬ 
lent to 


[a(k,s;f),a(kV;0]-|. = [a*(k,s;f),a*(k',.s'; f)]+ = 0 
[a(k l *;<),a*(k / ,s';0]+ = 


(47.16) 
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Substitution of (47.15) into (47.10) shows that 

= £ iVk, where iVt. = o*(k,s;<)a(k,s;0 (47.17) 

fee 

The work of Sec. 46 shows that the eigenvalues of each N^ a are 0 and 1, 
so that there cannot be more than one electron in each state (which is 
specified by the spin orientation, the value of the momentum, and the 
sign of the energy). Explicit expressions for the a's, analogous to (46.32), 
can also be written down. 

Substitution of (47.16) into the expression (47.6) for the field Hamil¬ 
tonian gives 

H = £ / (ihctfafl • grad ft - meSfPMdr 

a*(k,$;0a(k',s';£)fy(k,s;r) 

•(ihcciji ■ grad — mc%i)ViQsftf' ;t)dr 

The solution of Eqs. (43.16) shows that the Vj defined by (47.11) satisfy 
the equations 

T (ihcccfl ■ grad - mc 2 p n )vi(k\s f ;r) = JS? kv ^(k',s';r) 

7 

We then obtain, with the help of (47.14), 




(47.18) 


where the E^ g are given by (47.12) and (47.13). It follows at once that 
the N ka are constants of the motion, since they commute with each other 
and hence with H . 

Negative Energy States and Positrons. All the results obtained thus 
far in this section can be obtained from the assumption that the ^s and 
a's commute, aB well as from the assumption that they anticommute. 
This makes it seem that the Dirac theory can describe either particles of 
spin that obey Einstein-Bose statistics, or electrons that obey the 
exclusion principle. It is easily seen, however, that the field energy 
operator (47.18) has negative eigenvalues of arbitrarily large magnitude, 
which correspond to electrons in negative energy states (s = 3,4). The 
existence of such eigenvalues implies that there is no equilibrium con¬ 
dition of the field when electromagnetic interactions are taken into 
account, since an electron will make radiative transitions to states of lower 
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and lower energy. This difficulty cannot be avoided so long as the 
particles are assumed to obey Einstein-Bose statistics. 

It was proposed by Dirac that the undesirable transitions to negative 
energy states be eliminated from the theory by the assumption that the 
normal condition of empty space is that in which all positive energy states 
are empty and all negative energy states are full: 

N ki - iV k2 = 0, = N ki - 1 for all k (47.19) 

This is an equilibrium condition of the field, since the exclusion principle 
prevents transitions into the negative energy states. It is also assumed 
that the infinite density of negative-energy electrons produces no observ¬ 
able electromagnetic or gravitational effects, but that departures from 
the vacuum values (47.19) are observable in the usual way. We there¬ 
fore subtract the vacuum values 



for the total electric charge and total energy, from the total charge and 
energy operators cN and H, respectively; e is the (negative) charge of an 
electron. 

The resulting expression for the total observable charge is 


•HIl 

k a-1,2 a-3,4 


ATi 


ka 


1 — N ka = a(k,s;i)a*(k,s;i) (47.20) 


The new operator AW has the eigenvalue 0 when the state ks is full and 
the eigenvalue 1 when that state is empty. The similar expression for the 
total observable energy is 


2(2 N*JEu+ 2 tfta'ltfkj) 

k a = 1,2 « — 3,4 


(47.21) 


According to (47.20), each positive-energy particle acts like a negative 
electron, and each missing negative-energy particle like a positive elec¬ 
tron; (47.21) then shows that the observable energy is positive, and equal 
to the sum of a positive term for each positive-energy particle and for 
each missing negative-energy particle. It is therefore reasonable to 
interpret the “holes” in the otherwise occupied set of negative energy 
states as positive electrons or positrons. Because of the relation (47.13) 
between energy and momentum, positrons have the same rest mass as 
electrons. The existence of positrons was predicted by Dirac on the basis 
of this theory in advance of their discovery in cosmic radiation. 1 

1 For further discussion of the formal aspects of positron theory, see Wentzel, 
op. tit., Chap. V. 



354 


QUANTUM MECHANICS 


[Chap. XIII 


We have seen that the Dirac theory, which describes particles of spin 
ih 3 can only be quantized in accordance with the exclusion principle. 
This is a special case of a general result, obtained by Pauli, 1 according to 
which particles of zero or integer spin obey Einstein-Bose statistics and 
particles of half-odd-integer spin obey Fermi-Dirac statistics. The con¬ 
nection between spin and statistics can be derived only for relativistic 
theories. For example, the negative energy states that prevent Einstein- 
Bose quantization of the Dirac equation appear only in a relativistic 
theory, and the nonrelativistic Schrddinger equation was successfully 
quantized in both ways in Sec. 46. 

Anticommutation Relations at Different Times. All the commutation 
and anticommutation relations that have been used up to now have been 
between quantities that refer to the same time. In a relativistic theory, 
however, there is some reason for being interested in relations of this kind 
between quantities that refer to different times. Such relations can be 
used to investigate the causal connections between events that occur at 
different times and places, and hence the relativistic character of the field 
theory as a whole. 

To see this, suppose that we have a physically observable quantity, 
such as particle or charge density, which can be represented by an opera¬ 
tor that depends on r and t. It is interesting to inquire under what cir¬ 
cumstances the values of this observable at different space-time points 
can be measured without interference of one measurement with the other. 
Measurements without interference can be made if the two operators that 
represent the observable at the two points commute with each other, 
since then their matrices can be diagonalized simultaneously and precise 
results (eigenvalues) can be obtained from both measurements. We 
expect that observables at the same time and different space points will 
always commute, since there is no way in which an effect can be propa¬ 
gated over a finite distance in zero time. In a nonrelativistic theory, 
observables at different time and space points need not commute, for 
there is no limit to the speed at which effects can be propagated. In a 
relativistic theory, however, we expect that observables at different 
space-time points always commute if the spatial separation of the two 
points is greater than c times the time interval. Thus commutation or 
anticommutation relations at different times can provide a direct physical 
check on a relativistic theory, but are of little interest in connection with a 
nonrelativistic theory. 

The anticommutation relations (47.8) or (47.16) are conveniently 
extended to different times by finding the equations of motion for the 
a's. We have from Eqs. (45.23), (47.16), and (47.18) that 

1 W. Pauli, Phys . Rev., 68, 716 (1940). 
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&'ftd(k,s; t) = [a(k,s; t),H] = jE7k 8 a(k > s; t) 
ihd*(k 7 s) 0 = [a*(k,s; t),H] = —j®h»a*(k,s; t) 

These are easily integrated to give 


a(k,s;/!) = a(k,s; 0 ) exp 


a*(k,s;/) = a*(k,s; 0 ) exp 


(47.22) 


Equations (47.16) apply when both the times that appear are the same; 
they can both be taken to be zero, in which case we can use (47.22) to 
obtain 

[a(k,$; 0 ,a(k',s'; t')] + = [a*(k,s; t)a*(k',s') *')]+ = 0 
[a(k,s; £),a*(k',s'; «')]+ = exp — - - -- (47,23) 

Substitution of (47.23) into (47.15) enables us to calculate the anti¬ 
commutation relations for ^’s at different times. It is apparent that 

Wv(r lO.lMrVOk = (*701+ = 0 (47.24) 

The anticommutator bracket for i p and becomes 

r , ,/ , )]+ = X [a(k 9 s; 0 ^a*(fc / ,« / ; r)*>i(k%«'; r') 

kk 7 ^ 

= l «i(k,«; r)S,(k,s; r') exp —- (47.25) 

kl 

The last sum could be simplified at once by making use of the closure 
property 

^ ?/j(k,ff)£i(k,a) = 5,7 (47.26) 

a 

which follows from (43.17) and (43.18), were it not for the fact that E^ s 
has different signs for different values of -s. 

As it is, we can st ill use (47,26) if we first rewrite the exponential in 
(47,25) so that it, does not involve 8 explicitly. Wc put 


/ ?74„A Ak 

(- — ) * “* -« 


. . Ak*T 

i sin - 7 — 
h 


___ I^kulr _ sin ( \E^ a \r/h) 
ft' ft.'“U4.I/A 

where t = t — |/ 4 a|/A = +c(fc 2 + /r*)* iw independent of $, and 

fco = mc/h. We can write this as 

/ iWk«A / d iAJkA win rr(fc 2 + fc$)* 

CXP V A / \«r A / rfifc* + A'?) 1 
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The remaining factor E^ a can be replaced by an operator: 

■EkstyOM; r ) = ^ (ihcujif ■ grad — mc*p 3 r)vi'(k,8; r) 

Substitution into (47.25) gives, with the help of (47.26), 
[^(r,i),i/'*(r / > t A )]+ = i 32 (k,s; r) 

k,g 

V <3 , __ j „ \ ,, sincr(A 2 + /<$)* 

• > ( + c« fl ' • grad + ick^vj v t >(k,s; r) — c ^ 2 _j_ ^. 2^4 

= (sf! ^ + c«,i • grad + ichopj^J 


I cttji * grad —I - ickofiji 


where 


i)l>(r - r\ 


- O (47.27) 


^ £ I/ -3 (exp * • r) (47.28) 


It can be shown that (47.27) reduces to the third of Eqs. (47.8) when 
t f = t (see Prob. 14). 

Commutation Relation for the Charge Density. The charge density 
in the Dirac theory, without the modification (47.20) that results in 
positrons, is 

p(r,i) = exp*(i,t)ip(r,t) = e ^ &*(r Mifat) 

3 

In order to discuss the extent to which measurements of p at different 
space-time points interfere with each other, we must calculate the com¬ 
mutator bracket 

[p(r,i),p(r',f')] = 

With the help of Eqs. (47.24) and (47.27), this can be reduced to 
[p( r »0>p( r SO] = e 2 ^ ^*(r,i)^(r',i') ^6,7 — + can ’ grad + ickufiji^ 

• D(r - r \t - t') - H.A. 
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where H.A. signifies the Hermitian adjoint of the preceding term. In 
general, p(r ,t) will commute with p(r',2') only when D(r — r r , t — t f ) is 
zero. 

We therefore examine the structure of the D function by replacing the 
k sum in (47.28) by an integral, as in Eq. (11.14) i 1 

■Dfao / ( ex P * • r ) *1 (47.29) 

The integration over the polar angles of k is readily carried out; the result 
can be written 

D(r,£) = (27r 2 rc) _1 J k(k z + /c 2 ) - "* sin kr sin ct(k 2 + kffldk 

= — ( 47 r 2 rc )” 1 ^ J (Jo 2 + &§)”* cos kr sin d(/b 2 + k$)*dk 
The substitution k = k 0 sinh x transforms this into 


5 / 00 

D(r,i) = — ( 47 TVC )" 1 / cos (/b 0 r sinh x) sin (kqct cosh x)dx (47.30) 
The integrand in (47.30) can be rewritten as 

£ sin (kod cosh x + k Q r sinh x) + ^ sin (koct cosh x — kor sinh x) (47.31) 

Further reduction depends on the relative magnitudes of ct and r. 
Suppose first that ct is positive and greater than r, which is always posi¬ 
tive; we can then put 

r 

koct cosh x ± k Q r sinh x = fc 0 (c 2 f 2 — r 2 )* cosh (x ± 0), tanh 0 s — 

ct 

The integral in (47.30) then becomes 

i j** sin [z cosh (x + 6)]dx + i sin [z cosh (x — 6)]dx f 

z s ko(c 2 t 2 — r 2 )* (47.32) 

These have the form of one of the integral representations of a Bessel 
function : 2 

1 f °° 

Jo(z) = - / sin (z cosh x)dx 


1 P. A. M. Dirac, Proc. Camb . Phil Soc., 30,150 (1934). 

2 E. T. Whittaker and G. N. Watson, “A Course of Modern Analysis,” 4th ed., 
p. 382 (Cambridge, London, 1936). 
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The two integrals in (47.32) are equal, and their sum is then equal to 
irJ 0 ( 2 ). It is apparent that for ct negative and less than — r, the integral in 
(47.30) is equal to — 

To consider the case in which ct lies between r and —r, suppose first 
that ct is positive and less than r. We then use a different expression for 
the arguments of the sines in (47.31): 

ct 

koct cosh x ± fc 0 r sinh x = ±& 0 (r 2 — cH 2 )* sinh (x ± 0'), tanh 0' = — 
The integral in (47.30) then becomes 

i J_ * sin [z l sinh (x + 8')]dx — $ J_ * sin [z' sinh (:x — 6')]dx, 

z' = /c ( j(r 2 — cH 2 )* 

Each of these integrals vanishes, since the integrands are odd functions 
of x + 0' or x — d'. We see then that the integral in (47.30) is equal to 

7r«/o[fto(c 2 2 2 — r 2 )*] for ct > r 

0 for r > ct > — r (47.33) 

—wJQ[kv(cH 2 — r 2 )*] for — r > ct 

We have thus shown that there is no interference between measure¬ 
ments of charge density at two different space-time points if and only if 
the spatial distance between the points is greater than c times the time 
interval. In this case there is no physical mechanism whereby a dis¬ 
turbance can be communicated from either point to the other. 

The hypersurfaces ct = ±r in four-dimensional space-time constitute 
the light cone , which is the locus of all light pulses that pass through the 
space point r = 0 at the time t = 0. An explicit expression for Z)(r,/) 
infinitesimally close to the light cone can be obtained by noting that 
(47.33) changes discontinuously from t to 0 as r increases in crossing 
the ct = r half of the light cone, and changes discontinuously from —ir to 
0 as r increases in crossing the ct = —r half of the light cone. Now the 
derivative of an increasing step function is a positive 5 function, so that 
we obtain from (47.30): 

D(r,0 ^ (47T7'c)- 1 [5(r - ct) - 5(r + ct)], ct S ±r (47.34) 

It is interesting to verify that the right side of (47.27), with the expres¬ 
sion (47.34) for D near the light cone, reduces to (r) when t — 0; this 
shows that (47.27) agrees with the third of Eqs. (47.8) in the proper limit, 
as of course it must (see also Prob. 14). To see this, we note that the 
terms (ca ; -j • grad + ick$ji)D(t,t) vanish when i = 0, since the two 6 
functions in (47.34) cancel. The term (d/d/)D(r,Z), however, becomes 
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— (27rr) _ “ 1 5 / (p) when t = 0. We now show that this quantity is equiva¬ 
lent to 5(r). Let /( r) be an arbitrary continuous function that has a 
continuous gradient at r = 0; then J/(r)8(r)dr = /(0). We also have 
that 

— J /(r)(27r?')"' 1 5 / (r)rfr ~ ~~ J 0 f(r)(27rr)- 1 5'(r)Airr 2 dr 
= - f n "° f(r)2rS'(r)dr 
= — J“"f(r)r8'(r)dr (47.35) 

Since the fourth of Eqs. (11.13) states that rS'(r) can be replaced by 

— 8(r), we see that (47.35) is equal to/(0). 

Problems 

. Make use of Eqs. (45.12), (45.13), and (45.14) to show that the classical 
Hamiltonian equations of motion for a field agree with the Lagrangian equations in the 
cell approximation. 

2. Show that ft = oven if the functional density F depends on grad p and 

grad 7 T as well as on p and 7 r. 

3. The wave amplitude ^(r n ,0 can be regarded as ft functional to which corre¬ 
sponds the functional density \p(T,t)5(T — r 0 ), and similarly for r. Show that Eq, 
(45.20) gives the correct aquations of motion for p and when uso is made of these 
functional densities. Also use them to calculate the Poisson-bracket expression for \p 
and 7 r at different points and the same time. 

4. Show in detail that the commutator of N and the kinetic-energy term 
grad • grad \p in (40.10) is zero, and that the commutator brackets in (46.6) are 
constant in time. 

6 , Make use of Eqs. (40.14) and (40.15), and the answer to Prob. 1, Chap. VI, 
to show that the matrix for <ik defined by (46.11) cannot be diagonalized. 

6 . Extend (40.11) and the following aquations to the situation in which the ith 
constitute a continuous set of functions. Then make use of Prob. 2 , Chap. Ill, to 
show that the measured number of particles in the infinitesimal neighborhood of any 
point is a positive integer or zero. 

7. Show that Eqs. (40.21) are completely equivalent to Eqs. (40.20) if then's 
are orthonormal. 

8. Show that the anlicommutator bracket has different algebraic properties from 
the commutator bracket, and the Poisson bracket. 

9. It is shown in the text that an electron field amplitude is not. classically meas¬ 
urable, since electrons obey the exclusion principle and have electric charge. Explain 
how it is that the diffraction pattern of electrons scattered by n crystal, which repre¬ 
sents interference of electron amplitudes, can be measured classically. 

10. Show that N is a constant of the motion for nonrolativistic Fermi-Dirac 
particles. 

11. Consider a system of Fermi-Dirac particles for which there are two states. 
Obtain explicit matrices for the a’s and 'p's that are analogous to (46.20) and (46.30). 

12. Show that the unquantized Hamiltonian equations for the Dirac theory agree 
with the corresponding Lagrangian equations. 

13. Show that N given by (47.10) commutes with the Dirac field Hamiltonian 
(47.6). 
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14. Show that the anticommutation relation (47.27) for ^ and at different 
space-time points reduces to the third of Eqs. (47.8) when the times are made the 
same. Make use of the replacement of the k sum in (47.28) by the k integral in 
(47.29), and a representation of the 5 function given in Sec. 11. 

15. Show that [p(r,£), p(r',t)] 5=3 0 for both quantizations of the nonrelativistic 
Schrddinger equation, and for the quantized Dirac theory. 

16. Show that if the wave functionals satisfy Schr6dinger-type equations ihi 

« H&j — ihif* = then the calculation of the time derivative of a general 

matrix element of F results in Eq. (45.23) [compare with the transition from (23.1) 
to (23.2)]. Show also that if the set of SE^s diagonalizes H as well as N, each V oscil¬ 
lates in time with a frequency determined by the corresponding eigenvalue of H. 



CHAPTER XIV 

QUANTUM ELECTRODYNAMICS 


The field quantization technique developed in the last chapter will 
now be applied to the electromagnetic field. In this case, the classical 
wave theory contains no suggestion of photons, while the quantized 
theory successfully explains the wave-photon duality discussed in Chap. 
I. It is convenient to consider first the electromagnetic field in empty 
space (Sec. 48), and then to take up the interaction between electrons and 
the field (Sec. 49). Some problems that illustrate the theory are solved 
in Sec. SO . 1 Since we are not concerned with establishing the Lorentz 
invariance of the theory, we use three-dimensional rather than four- 
dimensional notation throughout. 


48. ELECTROMAGNETIC FIELD IN VACUUM 
We shall follow the methods developed in Sec. 45. The equations of 
motion of the electromagnetic field are Maxwell's equations, and we 
start by finding a Lagrangian whose variation yields these equations. 
From this, canonical momenta can be defined, and a Hamiltonian can be 
set up. Quantization is effected by replacing the classical Poisson 
brackets by commutator brackets. We shall not consider the possible 
existence of anticommutation relations between the field variables, since 
experiment shows that strong electric and magnetic fields are classically 
measurable and that photons obey EinBtein-Bose statistics. 

Lagrangian Equations. Maxwell's equations in empty space are 
obtained by setting p and J equal to zero in Eqs. (35.2): 


i c , 1 dH A lU 1 dE n 

curl EH- 57 = 0 , curl H--rj « 0 

C dt ' C dt 

div E = 0 , div H = 0 


(48.1) 


The Lagrangian is most conveniently expressed in terms of the potentials 
A,</> that are partially defined by 


E = - - — grad <t>, H = curl A (48.2) 

C CJL 


1 For further discussion, see the references cited in footnote 1, page 329, and also 
E, Fermi, Rev. Mod. Phys 4, 87 (1932); L. Rosenfeld, Ann. Inst. Henri Poincar6, 1, 
26 (1931); W. Heitler, “The Quantum Theory of Radiation," 2d ed. (Oxford, New 
York, 1944). 
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As discussed in Sec. 35, this does not specify the potentials completely, 
since gauge transformations of the potentials can still be made without 
altering the electric and magnetic field strengths computed from (48,2). 
The Lagrangian density can be taken to be 

< 48 - 3 > 

The Lagrangian equations are obtained from (45.8) if we regard A x , A v , 
A g , and <f> as the field variables. Variation of the components of A gives 
three equations that can be written together as 

- icurtcuil A - s;|(~ + gred *) - 0 
This is the same as the second of Eqs. (48,1). Variation of 4> gives 

- s div (; i 7 + 8r * d *)- 0 

which is the same as the third of Eqs. (48.1). The definitions (48.2) 
for the potentials automatically satisfy the other two of Maxwell’s 
equations. 

Hamiltonian Equations. The momentum canonically conjugate to 
A x is found from (45.15) and (48.3) to be 

< 48 - 4 > 

with similar expressions for the other two momenta. The momentum 
canonically conjugate to <jf> vanishes identically, since <j> does not appear 
in the Lagrangian density. A similar situation was encountered with \f/ 
in the nonrelativiatic SchrSdinger equation (Sec. 46) and in the Dirac 
equation (Sec. 47); as before, it means that cannot be treated as a field 
variable and must be eliminated from the Hamiltonian . 1 
The Hamiltonian density is obtained from (45.24): 

dA 1 

H = P • — - L = 2ttc 2 P 2 + — (curl A ) 2 - cP • grad <j> (48.5) 

where use has been made of (48.4) to replace dA/dt by terms involving P. 
The Hamiltonian equations of motion (45.19) are 

^ = 4tc 2 P - c grad <j>, ^ = - -L curl curl A (48.0) 

1 For an alternative approach, see P. A. M. Dirac, V. Fock, and B, Podolsky, Ph i/h. 
Zeits . Svwjetunion , 2, 468 (1932). 
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The first of these equations is the same as (48.4); it is necessary that (48.4) 
be obtained over again in this way, since the Hamiltonian formalism con¬ 
sists only of (48.5) and the canonical variables A and P. We can now 
make use of this equation to define a quantity E that is equal to — 4?rcP. 
Then the second of Eqs. (48.6) agrees with the second of Maxwell's 
equations (48.1), if we also define a quantity H to be equal to curl A. The 
first and fourth of Eqs. (48.1) are satisfied because of the way in which 
E and H are defined. 

The third of Maxwell's equations cannot be obtained as a Hamilton¬ 
ian equation based on (48.5). We can, however, say that we shall be 
concerned only with those solutions of the Hamiltonian equations for 
which div E = 0, or div P = 0, at some definite time. If then we can 
show that this restriction is maintained at all times, the solutions so 
chosen form a consistent and satisfactory set. The time derivative of 
div P is found from the second of Eqs. (48.6) to be 

(1 1 

— div P = — — div curl curl A = 0 

at 4 TT 

Since the field equations are of first order in the time derivatives, we have 
shown that the restriction, that div E = 0 at one instant of time is equiva¬ 
lent to the validity of the third of Eqs. (48.1) at all times. 

We now see that the last term in the Hamiltonian density (48.5) con¬ 
tributes nothing to the field Hamiltonian. Its volume integral can be 
transformed by means of a partial integration into c/0 div Pdr, which is 
equal to zero; the surface term vanishes because P either vanishes suf¬ 
ficiently rapidly at infinity or obeys periodic boundary conditions at the 
walls of a large box. The Hamiltonian is then 

II = J ^2rc* P* + 

and <j) has disappeared. This is in agreement, with the usual expression 
(l/87r)J(E 2 + H‘ 2 V/r for the total energy in the electromagnetic field. 

Quantum Equations- The classical electromagnetic field is converted 
into a quantum field in the following way. We start with the Hamil¬ 
tonian (48.7) and the canonical field variables A,P. Since 0 no longer 
appears, it is convenient, to choose the gauge so that 0 = 0. The general 
equation of motion is (45.23), and the commutation relations (45.25) 
between the field variables become 

W.(r,0, AA r',0] = [J\<r f 0, P.'(r\/)] - 0 
[i4 fl (r,/,), PV(rV)] = »A5^fi(r - r') 

Each of the indices s,s ' can be x 3 y , or z. 



(48.8) 
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The equation of motion for a typical component of A is 
ihA x (i,t) = [AafoO, H] 

A x commutes with the (curl A) 2 term in H , and also with that part of the 
P 2 term that involves P\ + PJ; thus we need calculate only the commuta¬ 
tor of A x and the P 2 term in H . This is 2?rc 2 times the integral over r' of 
[AajP'J 1 ], where the prime indicates that the argument is r' rather than r. 

[A x (r,t), P’(rV)] = AJP? - P'M m 

= [P'A* + m r - r')]P£ - P'M, 

= P' a [P'*A* + ^5(r - r')] + ihd(i - i')P' m - PM * 

= 2iA5(r — r')P a (i',t) 

Integration over r' gives the relation 

ihA a (r,t ) = 2s-c i j2ih8(t — T f )P a (i r j)dr f 

= 4TcW»(r,i) (48.9) 

This is the same as the corresponding classical equation, which is the first 
of Eqs. (48.6), when <£ = 0. 

The equation of motion for a typical component of P is 
ifiP x (x,t) = [P x (ht),H] 

P a commutes with all the integrand of H except for that part which con¬ 
tains the sum of the squares of the y and z components of curl A. The 
calculation of this commutator bracket is straightforward but tedious 
(see Prob. 1), and yields an expression for P that is in agreement with the 
second of Eqs. (48.6). Thus if we define E = —47 tcP and H = curl A, 
the quantum equations of motion for A and P agree with the first, second, 
and fourth of Maxwell’s equations (48.1). 

The third of Maxwell’s equations must be imposed as a supplementary 
condition } as in the classical case. If we set div P equal to zero at a 
particular time, it is always zero since its time derivative is zero. Equa¬ 
tion (48.9) then shows that the time derivative of div A is always zero, or 
that div A is a constant in time. It is convenient to restrict the choice of 
gauge further so that div A is zero everywhere at a particular time, in 
which case we see that it is zero at all space-time points. It is apparent, 
however, that the introduction of the supplementary condition is incon¬ 
sistent with the commutation relations (48.8). For example, the com¬ 
mutator bracket of A 9 and div P should be zero, since div P is zero, but 
is computed from (48.8) to be 

[A*(r,i), div' P(r',0] = ih 5(r - r') 
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It is not surprising that this inconsistency should arise, since Eqs. (48.8) 
imply that there are three independent pairs of canonical variables, 
whereas the restrictions div P = 0 and div A = 0 cause only two of these 
pairs to be linearly independent. We should therefore modify the com¬ 
mutation relations so that they are consistent with the supplementary 
condition. 

The nature of this modification is established in Prob. 2. It turns out 
that the commutator brackets of A(r,i) and P(r',i) do not vanish when 
r — r' is finite. This would appear at first to contradict the physical 
principle that there can be no interference between measurements per¬ 
formed at different places and the same time (see the discussion of Sec. 
47). However, the vector potential A is not in itself a physical quantity; 
only the electric and magnetic fields are directly measurable. We shall 
now show with the help of (48.8) that the commutation relations of E and 
H have the required infinitesimal character and are, moreover, consistent 
with the supplementary condition div E = 0. It can also be shown (see 
Prob. 3) that the same results are obtained by starting with the modified 
canonical commutation relations of Prob. 2. 

Commutation Relations for E and H. The electric and magnetic fields 
are defined by the equations 

E = —4ttcP, H = curl A (48.10) 

where the commutation relations for A and P are assumed to have the 
form (48.8). We see at once that 

[E*(r,0, E 8 '(r',0] = H(r,/,), H,'(r',0] = 0 (48.11) 

where each of the indices s,s' can be x } y, or z . The commutator bracket 
for typical parallel components of E and H is 

tE.Cr,0. H*(r',0] = -4xc [. P x , - ^)] = 0 (48.12) 

For typical perpendicular components of E and H* we obtain 

[E.(r,0, H»(r',f)] = -ire [p„ - ^)] 

= -4rc ^ [P X ,K\ 

= iircih 8(r - r') (48.13) 

oz 

Other relations similar to (48.13) arc obtained by cyclic permutation of 
x,y,z. 
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It follows at once from (48.11) that div E commutes with all com¬ 
ponents of E. The commutator bracket of div E and a typical component 
of H is, with the help of (48.13), 

[div E, H3 - [f, Hi] + [%, h;] 

-&~ r) + kk sv ~ r) ] (48 - 14) 

Since (d/dy')S(r' — r) = — ( d/dy)8(r ' — r), we see that the right side of 
(48.14) is zero. Thus div E commutes with E and H, and hence also with 
the Hamiltonian, which from (48.7) can be written as 



+ H 2 )dr 


(48.16) 


This means that div E is a constant of the motion, and so is zero at all 
space-time points if it is made to vanish everywhere at a particular time. 

As would be expected, the field commutation relations (48.11), (48.12), 
and (48.13), together with the Hamiltonian (48.15), can be used in place 
of the canonical formalism originally developed in terms of A and P. We 
have already seen that div E is a constant of the motion; a similar calcula¬ 
tion shows that div H is constant, so that it can also be made to vanish at 
all space-time points. The first two of Maxwell's equations (48.1) 
then follow as special cases of the general equation of motion (45.23) 
(see Prob. 5): 


ihL = [E., H] = i- j [E I( (H; 2 + H7)W 
= ihc{z\xA H) a 

A = [H* H] = A- J [H„ (Ep a + E7)]dr' 

= —ihc (curl E)* 


(48.16) 


Plane Wave Representation. For many applications, a representation 
of the potentials and fields in a complete orthonormal set of plane waves 
is useful. These plane waves are taken to be vector functions of r that 
are polarized perpendicular to the propagation vector so that the condi¬ 
tions div A = div P = 0 are satisfied. 


Ukx(r) = L~ ®£kx exp ik • r, X = 1,2 

The vectors k are chosen as in (11.3), so that the Ukx satisfy periodic 
boundary conditions at the walls of a large cubical box of volume L 3 . The 
Ek\ are unit vectors, and ct i, ek 2 , and k form a right-handed set, so that 
h ‘ e k\ = 0 and div Ukx = 0. It is easily verified that the orthonormality 
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property assumes the form 

J Ukx • Ut'x'dr = Skk'Sxx' 

We expand A and P in terms of the Ukx: 

A(r,f) = J' [gkx(0ukx(r) + ffkx*(Ohkx(r)] 

(48.17) 

[pkx(i)Ukx(r) + Pk \*(0ftkx(r)] 

The operators <?kx* and pkx* are the Hermitian adjoints of gtx and pkx, 
respectively; thus A and P are Hermitian. The primes indicate that the 
summations extend over half the k space, so that the plane waves flkx do 
not duplicate u_kx- 

We take for the commutation relations between the q ’s and p’s 

[<Zkx(0> Pk'v*(0] = [<Zkx*(0> Pk'x'M] = ihd kk'Svx (48.18) 

with all other pairs commuting, and verify that they give the correct com¬ 
mutation relations between A and P. It is apparent that 

M-(r,0, Mr',*)] = [P.faOi PAx'J)] = 0 

We also obtain from (48.17) and (48.18) 

[A»(r,i), — V V {[(/kx(0; Pk r x*(0]' w 'kx,fl(r)'ak ,r x , ja'(r / ) 

kx k'X' 

+ [</kx*(0> Pk'x^Ol^kx.aW^v.^tr')} 

= ihl~ z y € k x,a€kx. fl '[cxp ik ■ (r - r')] (48.19) 

kX 

The subscripts s,,s' denote cartesian components of the vectors on which 
they appear; the prime has been removed from the last summation in 
(48.19) since the primed summation over terms with both k and —k is * 
equivalent to a summation of k terms over the entire k space. 

If there were throe mutually perpendicular unit vectors £kx, then the 
three numbers ekx,* would be the direction cosines of the cartesian direc¬ 
tion s, and we would have that ^ 6k\ lfl ekxy = 8 aa '. Since there are just 

x 

two unit vectors ckx that are perpendicular to each other and to k, we can 
write 

«kx,«ekxy = d**’ — -jit 
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We also have that 

k,k,'[exp * • (r - r')] = [exp ik • (r - r')] 

With these substitutions, and the replacement of L“ 8 Y by (2r)~ 8 fdn 

K 

when L is large, we can rewrite (48.19) as 
[4 tf (r,£), P 8 '(r',0] = zhSaa'i (27r) _s /[exp ik • (r - T')]dr k } 

~ ih W. W,' { (3,r) " 8 / ¥ [exp * ‘ (r ~ r ')]^j (48.20) 

The first brace expression in (48.20) is equal to 8 (r — r'). The second 
brace is the Green's function Go(r,r') given in (26.12), which according to 
(26.15) is equal to (47r|r — r'l)” 1 . The commutator bracket (48.20) 
then becomes 

[AW), PA i',1)1 = M.Mx (,^7,) (48.21) 

which is that assumed in Prob. 2; the other commutator brackets vanish. 
This confirms the choice of the commutation relations (48.18). 

Quantized Field Energy. .Substitution of (48.17) into the field 
Hamiltonian (48.7) gives 

H = ^ ^4?rc 2 pkxPkx* + ^ <?kx?kx*^ (48.22) 

kX 

since #kx and gtx*, and ptx and pkx* commute; here use lias been made of 
the restriction on the summation to half the k space, which makes all 
integrals of the form Jutx * Uk'ydr vanish. 

We wish now to find the eigenvalues of H when the commutation rela¬ 
tions are as given in (48.18). This can be done by choosing linear com¬ 
binations of the plane wave amplitudes that make II formally equivalent 
to the sum of the energies of a number of harmonic oscillators (see Sec. 
46). Now each index pair k,\ corresponds to two linearly polarized plane 
waves that travel in opposite senses along the k direction. Thus wo 
want our new linear combinations of QkxUkx and p k xu k x to have the general 
forms 

a kx exp t(k ■ r - kct), a' kx * exp i(k • r + led) (48.23) 

where a kx and a'kx are operators that are independent of r and t. The first 
of these is a plane wave that travels in the positive k direction and has 
the positive angular frequency Icc, and the second is the Ilermitian adjoint 
of a plane wave that travels in the negative k direction and has the same 
frequency. 
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With the remarks of the last paragraph as a guide, we proceed to find 
the time dependence of qk\ and pkx. Their equations of motion are given 
by (46.23): 

= [qk\,H] = 4iriAc a pkx 

... r m ihk* (48.24) 

= VPvk,H] = - Qkx v 1 

Elimination of pkx gives a second-order equation for gkx 

tfkx = 4irc' 2 pkx = -/oVgfcx 

This is easily integrated to give 

3kx(«) = a k x<r tt- + a' k xV*“ (48.25) 

where we have followed the pattern indicated by (48.23). We then 
obtain from the first of Eqs. (48.24) 


(48.26) 


Equations (48.25) and (48.26) can be solved for the a 1 s. 

! / , 47 ric \ 

«kX = 2’ l H-?^kX 1 

«' k x* = i (qk\ - Pkx^ 

Similar relations hold for their Hermitian adjoints. The commutation 
relations for the a’s can be obtained from (48.27) and (48.18): 


nihot 


—ikot 


(48.27) 


[ttkX,«*k'X'] = [u'kX^Vx'] = 


(48.28) 


with all other pairs commuting; these are independent of the time, as 
they Bhould be. 

Substitution of (48.25) and (48.26) into the Hamiltonian (48.22) 
gives 

II = V ~ (etkxfflkx* + a'kx*a'kx) (48.29) 

kX 

If we adopt the definitions 

A^tx = 2^«kx*a k x, A^kx = 27rhc (48.30) 

we see from the work of Sec. 46 that N k x and Af'kx each have the eigen¬ 
values 0,1,2, .... In terms of the N’n, the Hamiltonian (48.29) 



370 


QUANTUM MECHANICS 


[Chap. XIV 


becomes 

H = Y hck(Nto + N'n + 1) (48.31) 

tx 

Because of the structure of (48.23) and (48.25), we can identify a' k x with 
o-kx, and N'kx with N- tx. Then the restriction on the summation of 
(48.31) to half the k space can be removed to give 

FI = Y hc,Jc(N kl + i) (48.32) 

kX 


Equation (48.32) is equivalent to Planck's quantum hypothesis: the 
energy associated with each plane electromagnetic wave is an integer 
multiple of the fundamental quantum hv = hkc . In addition to the 
Planck energy, however, there is the harmonic-oscillator zero-point 
energy of one-half quantum per state of the field, which is infinite since 
there are an infinite number of states. This infinite energy is not objec¬ 
tionable since it does not interact with charged matter. 1 

Quantized Field Momentum* The momentum density of an elec¬ 
tromagnetic field is the Poynting vector (c/47r)E(r,£) X H(r^) divided by 
c 2 . The total momentum in the field can then be written in terms of the 
canonical variables, with the help of (48.10), 


G = i~cf ExHdr= - /PX (curl A )dr 
Substitution from (48.17), (48.25), (48.26), and (48.30) gives 

G = i } k(Pkx?kx* - Pkx*?kx) 

kX 

= V k/c[(a k xct k x* + a k x*a k x) - (aWkx* + a'kx V kx )] 

kX 

= ^ ftk[(XT k x + i) - (W' k x + *)] = V KkNu 

kX kX 


(48.33) 


where the restriction on the summation is removed as in the transition 
from (48.31) to (48.32). In this case, the zero-point terms cancel for 
plane waves that travel in opposite directions. 

Equations (48.32) and (48.33) show that the energy and momentum 
of each plane wave are quantized in units of hkc for the energy and hk for 
the momentum. It will also be shown explicitly in Sec. 50 that the intcr- 


1 See also the discussion following Eq. (50.17). 
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action, between matter and radiation is such as to account both for the 
interference of light and for the discrete properties of light quanta. 

A(r 9 t) in the Plane Wave Representation. The vector potential 
appears in the interaction between electrons and the electromagnetic 
field discussed in the next two sections. We shall therefore require an 
expression for A(r,£) in the plane wave representation that is specified by 
the eigenvalues nkx of the operators iVkx. A typical wave functional for 
this representation can be written as ^(- • • ntx • • ■), which describes a 
state of the electromagnetic field in which there are ntx light quanta with 
momentum Kk and polarization ckx» We then see from Eq. (48,28) and 
the work of Sec. 46 that the operators and akx* have the properties 


*»*(• *(•••«»-1 • • ■) 


(48.34) 

• •) 


We obtain from (48.17) and (48.25) 


A(r,<) = Ir i ^ skx[(«kx« _ '* <!i + a'kxV* cl )(exp ik ■ r) 

kX 

+ (akxV ftfii + a'kxc - ^ 0 ') exp(— £k * r)] 

= It * ^ £kx{%x[oxp i(k ■ r - kct) ] 

kx 

+ a-k\* exp[ —i(k - r — lcct)]} (48.35) 


Here again the restriction has boon removed from the summation by 
identifying a f kx with a~, kx. This expression for the vector potential is 
easily seen to be llermitian, as it must be. It follows from the structure 
of (48.34) that a k x and ttkx*are destruction and creation operators, respec¬ 
tively, for a light quantum in the state k,X. Thus a term in the Hamil¬ 
tonian lineal’ in A would give rise to the emission and absorption of light 
quanta. 

Commutation Relations at Different Times. It is interesting to 
generalize the commutation relations (48.11), (48.12), and (48.13) for 
the components of E and H, to the ease in which the times are different. 1 
As with the quantized Dirac equation (See. 47), the result shows under 
what circumstances measurements of the electromagnetic fields at dif¬ 
ferent space-time points affect each other. 

Expansions for E and H in terms of the Gkx, that are analogous to 
(48.35), can be found without difficulty: 

1 These commutation relations arc*, due to P. Jordan ancl W. Pauli, Zeita.f. Physik , 
47, 151 (1928). 
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E(r,2) = ^f/fc£kx{akx[exp i(k ■ r — kct)] 

— <ztx* exp[— i(k • r — te£)]} (48.36) 

H(r,i) = Ir® y i(k X skx){okx[exp i(k ■ r — kct)] 

kx 

— dk\* exp[—i(k ■ r — te£)]} 

The commutator bracket for two cartesian components of the electric 
field strength is 

[Ea(r^), Ea'CrSO] = -k ” 8 Y ^nrihcke^e kxy sin (k • 9 — kor) (48.37) 
tx 

9 == r - r', T = t - t' 

where use has been made of (48.28). The summation over the polariza¬ 
tion index X can be evaluated by the technique used on the similar sum¬ 
mation in (48.19): 



€kx lfl €kxy sin (k • 9 — ken) = ^ — k B k a ) sin (k ■ 9 — ter) 


-f[?s (48 - 38) 

The summation over k can be evaluated by replacing it with an integral 
for large L : 


Zr 8 ^ kr 1 sin (k • 9 — ter) * (2r)~ z j kr l sin (k • 9 — 7ccr)dr ft 

k 

= (27t) - 8 J (2ifc)" 1 [exp (tk • 9 — iker) 

— exp (— ik * 9 + ikcr)]drh 
= (27t)~ s J (2^) -1 [exp (zk • 9 — iker ) 

— exp (ik * 9 + ikcr)]drk 
= — (27r)“ 3 Jfc _1 (exp (zk ■ 9 ) sin her dr k 


We denote the last expression by —cD 0 ( 9 ,r), where D 0 is the same as the 
D function of Eq. (47.29) except that k 0 is set equal to zero. 

We can see from (47.30) and (47.33) that £> 0 ( 9 ,r) is obtained as the 
result of operating with — (47r 2 pc)” 1 (^/^p) on a function that is equal to 
+ 7 r for ct > p, 0 for p > cr > —p, and — t for — p > cr. Thus Do has 
the form (47.34) for all p, not just for p in the infinitesimal neighborhood 
of |cr|: 

£o(9>r) = (4rpc)“ 1 [5(p — cr) — 5(p + cr)], p = |r — r'[ (48.39) 
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It then follows from (48.37) and (48.38) that 

[E.(r,0> E«'(r',i')] 

= -4W»c* | 57 J Z>o(r f) (48.40) 

The commutator bracket for two cartesian components of the magnetic 
field strength can be found in the same way, and is 

[HaM, HXr',0] = [E fl (r A W/)] (48.41) 

An expression for the commutation relation between components of 
E and H can also be obtained from (48.3(1). 

[E.(r,<), H,(r'/)] 

= L~ z ^ Airihck (eti, s 6 t 2 ,a' — sin (k • g — Zccr) (48.42) 

k 

It is apparent that (48.42) vanishes if s = s', so that parallel components 
of the electric and magnetic field strengths commute at all space-time 
points. If Sy* s', we can put s = x, s' = y , where x,y,z form a right* 
handed set of axes, in which case we see that 

6ki.x € k2.v eic2,j:Cki,y — (ckl X £ks)* = 

An analysis similar to that which led from (48.37) to (48.40) then shows 
that 

[E,(rA H„(r'/)1 = -1 tMc ~ - D „(r f) (48.43) 

The subscripts x,y,z can bo permuted cyclically in (48.43). 

Since all these commutation relations involve the Do function (48.39), 
we see that all components of the field strengths commute except in 
the infinitesimal neighborhood of the light cone c(t — t') = ±|r — r'|. 
Thus the field strengths at space-time points so situated that a light sig¬ 
nal cannot pass from either one to (die other, commute with each other 
and can both be measured precisely. This shows that the quantized 
electromagnetic field is propagated with the classical speed of light c. 
A discussion of the connection between these commutation relations and 
the uncertainty principle has been given by Bohr and Roscnfeld . 1 

49. INTERACTION BETWEEN ELECTRONS AND THE 
ELECTROMAGNETIC FIELD 

The quantization of Dirac’s relativistic wave equation for free elec¬ 
trons was carried through in Sec. 47. We jdnill now combine this freo- 

1 N. Bohr and L. Itouenfeld, Det Kgl. Danske Vidennk. Siiukab, MaL-Jys, Mciid,, 12, 
8 (1933). 
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electron theory with the quantum theory of the electromagnetic field in 
vacuum, given in Sec. 48. The resulting formalism is called quantum 
electrodynamics; it provides a description of the interaction between elec¬ 
trons and the electromagnetic field. 

Tjigrangiim and Hamiltonian Equations. We wish first to obtain a 
Lagrangian whose variation yields the earlier equations of motion for the 
electron field and the electromagnetic field. Dirac’s electron field equa¬ 
tion is (43.22): 

ih ~ — e<t>\[/ + a • (— ihc grad i/' — eAip) + me = 0 (49.1) 

at 


where e is the charge on the electron, and hence is a negative number. 
Maxwell's equations for the electromagnetic field are given in (35.2): 


curlE + ^ =0 > 


ill 1 dE 4 tt t 
curl H — *7r7 = — J 
c dt c 


div E = 4 irp, div H = 0 


(49.2) 


The electric charge and current densities that go into (49.2) can be 
obtained from equations (43.20): 


p = J = — ecV'*cn/' (49.3) 

We assume for the present that p and J are due entirely to electrons. We 
can verify the electric continuity or conservation equation (35.3) by 
multiplying (49.1) on the left by i/'*, its Hermitian adjoint equation on 
the right by \f/ } and taking the difference of the two results. 

The desired Lagrangian is just the sum of the Lagrangians for the 
separate electron and electromagnetic fields, with the electron operators 
ih(d/dt ) and —ihc grad replaced by ih(d/dt) — and — ihc grad — cA, 
respectively. We thus obtain from (47.1) and (48.3) 


L = j j "ih ~ — C(p}j/ + a • ( — grad — cA)i^ + dr 

+ i / [(; “ + *"* d *)' - < c “ ri A >’] * 

It can be shown that variation of in (49.4) leads to ( 49 . 1 ), variation of 
\f/ leads to the Hermitian adjoint of (49.1), and variation of A leads to 
Eqs. (49.2) when use is made of (49.3) and (48.2) (see Prob. 9 ). 

The Lagrangian (49.4) suffers from the defects noted earlier in con¬ 
nection with (47.1) and (48.3). Since and <j> do not appear in (49.4), 
their canonical momenta cannot be defined, and so \p* and $ must be 
eliminated as coordinate variables from the Hamiltonian theory. As 
before, the momentum canonically conjugate to a component 1/7 of ^ is 
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ihfy, and the momentum canonically conjugate to a component A x of 
A is P» = (47rc)~H(V c )(dA*/d£) + (dcfr/dx)]. The Hamiltonian then 
becomes 


H- f (iW% + -P-%)dr-L 

= / i^*[a • (i^cgrad + eA)\p + e$$ — mc 2 p\l/]dr 

+ J ^2ttc 2 P 2 + gL (curl A) 2 - cP ■ grad 0 j dr (49.6) 


and appears as a canonical momentum variable. 

It is not difficult to show that the Hamiltonian equations of motion 
for \[/ and its canonical momentum ihp* are (49.1) and its Hermitian 
adjoint, respectively. The Hamiltonian equations for A and P are 


== = 4ttc 2 P - cgrad <j> 
ot 

— = — -r- curl curl A — 


(49.6) 


Thus if we define E = — 4tcP and H = curl A, as before, we obtain the 
first, second, and fourth of Maxwell’s ecillations (49.2). 

Elimination of <£. The third of Maxwell’s equations (49.2) must be 
obtained as a supplementary condition, as the corresponding equation was 
in Sec. 48. We shall therefore be concerned only with those solutions of 
the Hamiltonian equations for which divE — = 0 at some definite 

time. If then the time derivative of this quantity is zero, the restriction 
is maintained at all times, and the solutions so chosen form a consistent 
and satisfactory set. With the help of the second of Eqs. (49.6) and the 
definition of E, we see that 

(div E — p) = Airr | c div(^*«i^) — —r (t/'*i/0 

ol [_ ot 

This is zero because of (49.3) and the equation of continuity for the elec¬ 
tric charge and current densities. 

We can now see that the two terms on the right side of (49.5) that 
involve <j> cancel each other. The second $ term can be integrated by 
parts to give 

— cjP • grad <f> dr = c[<j) div P dr = — H^pdr 

whicli is equal to and opposite in sign from the first <f> term 

Thus <j> has disappeared from the Hamiltonian, and may be chosen in any 
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convenient way. The choice is made so that when P (or E) is divided into 
8olenoidcd and irrotaiional parts, the latter is expressed entirely in terms 
of <f>. We put 

P = Pi + P 2 

div Pi = 0 (Pi is solenoidal) (49.7) 

curl P 2 = 0 (P 2 is irrotational) 

If now we put P 2 = (Airc)^ 1 grad <j> } we see that the third of Eqs. (49.7) 
is satisfied, and that the first of Eqs. (49.6) becomes 

r)A 

= 4™*P! (49.8) 

It is now possible to have div A = 0 at all times if it is true at any one 
time, since (49.8) and the second of Eqs. (49.7) show that ( d/dt ) div A 
= 0. We therefore choose the gauge so that div A = 0. 

The <t> potential reappears in the Hamiltonian (49.6) through the P 2 
term. The volume integral of P 2 can be written, with the help of the 
expression for P 2 and a partial integration, 

/P 2 dr = JP?dr + J(2Pi + Pi) ■ P 2 dr 

= f -Pldr+ A j (2Pi + P 2 ) ■ grad <t> dr 

= J I>ldr - A- J * div (2Pi + P',)dr 

Now div Pi = 0, and the supplementary condition is that div P 2 = — p/c; 
thus the P 2 term in H becomes 

27rc 2 /P 2 dr = 27rc 2 jP?dr + -J-J <j>pdr (49.9) 

The choice of <£ tells us that 

V 2 <£ = 4tc div P 2 = —47rp 

This equation can be integrated by making use of the Green's function 
(26.15) with h = 0; the result is 

4>(r,t) = f dr' (49.10) 

The Hamiltonian (49.5) can now be rewritten, with the help of (49.9) 
and (49.10), 

H = J ■ (the grad + eA)\[/ — mc 2 /3^]dr 

+ j + i- (curl A) 2 jdr + £ J J (49J1) 
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Here, div Pi = div A = 0, and p(r,i) = e^*(r,i)^(r,2). The last term in 
(49.11) is the internal Coulomb energy of the electric charge distribution 
p(r,£); it results from the elimination of and the irrotational part of P, 
and does not have to be inserted into the theory as a separate assumption. 

The solenoidal field vectors (Pi and A) are usually referred to as the 
transverse part of the electromagnetic field, since the electric and mag¬ 
netic field strengths in the corresponding plane wave solutions are trans¬ 
verse to the direction of propagation, as in Sec. 48. The irrotational 
Coulomb field vector (P 2 ) is called the longitudinal part of the field, since 
Eq. (49.10) shows that the contribution to Pa at one point from an 
infinitesimal element of charge at another point is along the line joining 
the two points. 

Quantization of the Fields. We can now obtain a quantum theory 
of the interacting electron and electromagnetic fields by adopting the 
equation of motion (45.23), the electron field anticommutation relations 
(47.8), and the electromagnetic field commutation relations (48.21) with 
Pi substituted for P. It will also be assumed that all components of \p 
and \p* commute with all components of A and Pi. 

The order of factors like \pf and \pi that are multiplied together in the 
Hamiltonian (49.11) is of course immaterial in the unquantized case. In 
the quantum theory, however, these factors do not commute with each 
other, and the result of a particular calculation will depend on the order 
in which they appear in the Hamiltonian. We shall see at the end of this 
section that a suitable Hamiltonian is obtained if all terms are left as 
they are, except for a change in the integrand of the Coulomb term. This 
integrand in (49.11) contains the term 

4 4 

p(r,/.)p(r\0 = y Y (49.12) 

J “ L Z “ L 

which we replace by 1 

4 4 

y Y (49.13) 

j-1 \Zf\ 

It can be shown with the help of the anticommutation relations (47.8) 
that (49.12) is equal to (49.13) plus 

4 

^ ^(r > 0^/(r',«)«(r - r') 
i-1 


1 Note that both (49.12) and (49.13) arc Hermitian. 



378 


QUANTUM MECHANICS [Chap. XIV 


Thus the change from (49.12) to (49.13) is equivalent to the subtraction 
from the Hamiltonian (49.11) of the quantity 


f f ^*(r,M(rV)5(r 

2 J J 


r') 

^ drdr' 


(49.14) 


This is evidently infinite unless 0*(r,£)0(rV) is zero; it will be shown below 
that this can happen only if there are no electrons in the field. 

The quantum equations of motion are obtained from (45.23), when 
the Hamiltonian (49.11) is modified by substitution of (49.13) for (49.12). 
The electromagnetic field equations agree with those obtained in Sec. 
48, except that Pi replaces P and the electric-current term appears as in 
the second of Eqs. (49.6). The electron field equation is the same as 
(49.1), except that <f> is replaced by 


e f 

It can then be shown that the time derivatives of the bracket expressions 
in (47.8) and (48.21) are zero, so that these equations hold for all time if 
they are valid initially, as was assumed (see Prob. 11). 

Inclusion of Static Fields. Thus far we have assumed that the 
electric charge and current densities arise entirely from the electrons that 
are described by the Dirac 0 field. The effect of a static charge distribu¬ 
tion can easily be included, by the addition of a term 4rp a to the right 
side of the third of Eqs. (49.2), and a term —e0 fl 0 to the left side of (49.1), 
where V 2 0 a = —47r p a . It is not difficult to see that the only effect on 
the Hamiltonian (49.11) is to add a term Je0 g 0*0dr. 

The situation of greatest practical interest is that for which 1 


= — Ze/r. 

This corresponds to a fixed (infinitely massive) point nucleus of atomic 
number Z, placed at the origin. With this addition, and the modifica¬ 
tion (49.13), the Hamiltonian (49.11) becomes 

H = j 0* ■ (ihc grad + eA)0 — 0 — 

+ j [ 2 ™^ + J- (curl A)*]dr + £ J J ^ dTdr ' ( 49 .15) 

jl 

where the primes indicate that the arguments are r' rather than r. 

Use of Perturbation Theory. It is natural to attempt to find the 
eigenvalues of the Hamiltonian (49.15), which would be the energy levels 
of the system of electrons, electromagnetic field, and nuclear Coulomb 
1 e ia the electronic charge, and hence negative. 
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field. All such attempts have failed, and there is reason to believe that 
these eigenvalues do not exist: that it is impossible to diagonalize this 
Hamiltonian. This reason derives from a perturbation approach that is 
based on the smallness of e. If e is set equal to zero, (49.15) becomes 
simply the sum of the free-electron Hamiltonian (47.6) and the vacuum 
electromagnetic-field Hamiltonian (48.7). These Hamiltonians have 
already been diagonalized; the eigenvalues correspond to solutions in 
which there are definite numbers of free electrons and of light quanta, 
with no interaction between any of them. 

For a finite value of e, neither the nuclear term of order Ze 2 , nor the 
interelectron Coulomb interaction of order e 2 , causes a fundamental 
difficulty. As we show below, the latter term would give an infinite 
electrostatic or longitudinal self-enei'gy (just as in the classical theory of 
point charges), were it not for the somewhat arbitrary substitution of 
(49.13) for (49.12). A more serious trouble arises from the ea ■ A term, 
which couples the electrons to the transverse electromagnetic field. This 
term is responsible for all interactions between electrons and light quanta, 
and will be used in the next section for the calculation of the emission 
and absorption of light by an atom. One of the effects that the ea • A 
term produces is an infinite transverse self-energy of a free electron, due to 
the virtual emission and reabsorption of light quanta by it; 1 we shall 
ignore this effect in what follows. 2 

We shall work mainly with the perturbation approach. In the 
remainder of this section we consider the matrix elements of the inter¬ 
electron Coulomb interaction, and ignore the transverse electromagnetic 
field. This example is of interest since it shows how the infinite electro¬ 
static Bell-energy is eliminated, and how the exchange interaction between 
electrons (which obey Fermi-Dirac statistics and are described by anti¬ 
symmetric many-particle wave functions) comes out of the quantized 
field theory. The perturbation treatment of the eat • A term will be 
taken up in See. 50. 

Matrix Elements of the Coulomb Interaction. We now consider the 
Hamiltonian (49.15) with the transverse electromagnetic field omitted: 


H 


= J \p* ^ ihc ol 


Zc 2 

grad \{/ — ■— — me W 


■) 


dr 


+ 


rZ/XW**' (49 - 16) 


1 See Hoitlor, op. rit. } Sec. 1S.2. 

2 A relativistically invariant way of subtracting out infinities of this type has 
recently boon developed by H, Tomonaga, Progress of Thear. Phytics (Kyoto), 1, 
27 (1946), Phys. 74, 221 (19-18), and by J. Schwinger, Phys. Rev., 73, 416 (1948). 
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This turns out . to be an adequate approximation when the speed of 
the electrons is small in comparison with the speed of light, since then 
light quanta are not likely to be emitted. In this case, it is possible to 
use the nonrelativistic Schrodinger equation (with spin included) to 
describe the electrons, although we shall continue to use the Dirac 
equation. 

There is a complete orthonormal set of energy eigenfunctions for a 
single electron in a Coulomb field, that we denote by w,-(n,r): 1 


J ^ Wjintfwjin' ,r)dr = 5„ 


?<■ 


ihc&ji • grad 


Ze 2 


’) 


(49.17) 


5,1 — mc 2 fai j Wi(n } T ) = E n Wj(n,T) 


These can become the free-electron solutions (47.11) when Z is set equal 
to zero. We expand \j/ and in terms of the w ’s as in (47.15): 


&(r.O = £ b(n,t)Wj(n, r) 

n 

= £ b*(n,t)Wj(n,T) 


(49.18) 


where , the b’s are operators that obey anticommutation relations like 
(47.16): 

[b(n,t), b(n',t)]+ = [6*(n,0, b*(n',t)]+ = 0 . 1Q . 

[b(n,t), b*{n',t)]+ = S» B ' i J 

Substitution of (49.18) into (47.10) gives, with the help of the ortho¬ 
normality relation in (49.17), 

N = j Wdr = £ b*(n,t)b{n,t) = ^ N n , N n = b*(n,t)b(n,t) 

n n 

In similar fashion, the first term in the Hamiltonian (49.16) becomes 

^ b*(n,t)b(n,t)E n = ^ N n E„ (49.20) 

n n 

In any calculation in which the second term in (49.16) is regarded as a 
perturbation on the first term, we are interested in finding the matrix 
elements of the second term in the representation (specified by the ft’s 


1 This set includes positive- and negative-energy continuum states as well as 
> bound states; it is made discrete by adopting periodic boundary conditions. The 
index n includes specification of the spin. 
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and w’s) that diagonalizes the first term. Substitution of (49.18) into 
the Coulomb-interaction term gives 



b*(nj)b*(n\t)b(n" jt)b(?i" f 9 t) 


• ^ J f |r — T , \~ l ibi(n 9 r)iDi(n r i T , )wi( k n" 

Ji J 


(49.21) 


We wish to find the matrix element of the operator (49.21) that corre¬ 
sponds to any pair of unperturbed wave functionals (see Sec. 46). Since 
electrons obey the exclusion principle, these wave functionals can be 
specified by stating which of the single-electron states n are occupied. 
Now, the b’& are destruction operators, and the b*’s are creation operators. 
Since two of each appear in each term of (49.21), this operator has non- 
vanishing matrix elements only for wave functionals that correspond to 
the same total number of electrons, and for which not more than two 
electrons are assigned to different states in the two wave functionals. 
Moreover, since the two destruction operators appear to the right of the 
two creation operators in each term of (49.21), the only non vanishing 
matrix elements are for wave functionals that correspond to the presence 
of two or more electrons. 

We see then that (49.21) is a null operator so far as a single electron is 
concerned, so that, the infinite longitudinal self-energy of single electrons 
has been eliminated. We now show that this elimination was effected 
by the replacement of (49.12) by (49.13). The difference energy (49.14) 
can be written 


|- ^ b*(n 9 t)b(n',l) ^ j J |r - — r')drdr' 

nn' j 

(49.22) 

Since b(n',t) destroys an electron in the state n', and fr*(ft,£) creates an 
electron in the state n, the diagonal matrix elements of (49.22) are those 
for which n' = n. Then b*(7i,t)b(n',1) can be replaced by N n 8 nn ', and 
there is a positively infinite contribution to the expectation value of 
(49.22) for each electron that is present. Thus the substitution of (49.13) 
for (49.12) was equivalent to the subtraction of the infinite electrostatic 
self-energy of each electron from the Hamiltonian. 

The expectation value or diagonal matrix element of (49.21) for a 
wave functional that corresponds to two or more electrons is a sum of 
terms, each of which involves a pair of occupied electron states. The 
term that refers to the states 1 and 2, for example, contains the matrix 



382 


QUANTUM MECHANICS 


[Craj>. XIV 


elements of four combinations of 6's multiplied by appropriate volume 
integrals; these four combinations are abbreviated as b^b^bzbi, 
b*b*bib 2 , and b%b*bib 2 . The anticommutation relations (49.19) show 
that the second and third of these are equal to each other and opposite in 
sign to the first and fourth. We can also see from (49.19) that (compare 
with Prob. 12) 

**(1,1, - - 0&ittMi¥(l,l, • ■ 0 

= ¥*(1,1, . . .)N itf*¥(l,l, . . .) = +1 (49.23) 

Equation (49.23) then shows that the part of the expectation value of 
(49.21) that refers to the occupied states 1 and 2 is 

«* / / Ir - r'l- 1 T K-(l,r)| 2 T M2,r')|’dr<Zr' 

7 1 

/ / |r - r'l— T ,r)i^(2,r) ^ w)z(2,r')^(l,r')drdr' (49.24) 

The second integral in (49.24) is called the exchange energy , and also 
appears when the expectation value of the Coulomb interaction is calcu¬ 
lated for an antisymmetric many-electron wave function like (32.7). 

60. RADIATION THEORY 

The quantum theory of radiation consists of the perturbation treat¬ 
ment of the ea. • A term, and usually also the Ze 2 /r term, in the Hamiltonian 
(49.15). Calculations have been carried through for several processes of 
physical interest that are of various orders in the electronic charge e. 1 
In most cases it is unnecessary to use the quantized Dirac field that is 
implied by (49.15), since only one electron at a time is involved and the 
theory of Sec. 43 is adequate. 

In this section we consider only the simplest processes of emission and 
absorption of light by an atom, and continue to use the complete quan¬ 
tized field theory. We assume that the nuclear Coulomb energy and the 
interelectron interaction energy can be combined into an effective atomic 
potential energy V(r) of the Hartree type for the electron under con¬ 
sideration (see Sec. 38). The resulting formulas agree with those 
obtained in Chap. X by means of a semiclassical treatment. At the end 
of this section, we show how quantum electrodynamics provides a quanti¬ 
tative explanation of the diffraction experiment discussed in Sec. 2. 
Both the wave-like properties of radiation (appearance of a diffraction 
pattern) and the particle-like properties (ionization of an atom by absorp¬ 
tion of a light quantum) result from Eq. (49.15) or (50.2). 

1 Details of some of these calculations and references to the original papers are 
given by Heitler, op. dt. 
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Formulation in Terms of Transition Probabilities. The development 
of quantized field theory in the last two chapters has been entirely from 
the point of view of Heisenberg’s form of the equations of motion of the 
field variables (see Sec. 23). This formulation centers attention on the 
dynamical variables rather than on the states of the system. Now, how¬ 
ever, we wish to calculate transition probabilities between states of the 
system of electrons and electromagnetic field, and in this way obtain 
expressions for the rates of emission and absorption of light quanta by an 
atom. This suggests that we make use of the time-dependent perturba¬ 
tion theory of Sec. 29, which was in fact first developed by Dirac in con¬ 
nection with the present problem. 1 

The states are represented by wave functionals that can be taken to 
satisfy the Schrodinger-type equations 


ih>Jr = Irt' 
-ihAf* = St'*// 


(50.1) 


where H is given by (49.15). It can then be shown that the dependence 
on time of the matrices that represent dynamical variables such as ^ and A 
arises from the change in time of the wave functionals 'I' that are used to 
calculate the matrix elements (see Prob. 10, Chap. XIII). Thus the 
time dependence is transferred from the dynamical variables to the wave 
functionals; the former will now be regarded as operators whose structures 
are independent of L 

Before proceeding further, we simplify the Hamiltonian by approxi¬ 
mately replacing the effect of the nucleus and all the other electrons on a 
particular electron by a llartree-typc potential energy E(r). With this 
change, Eq. (49.15) becomes 


II - //„ + IV 

Hu = J ■ grad \p -f V(r)\p — mr 2 /ty]dT 

+ J J^TTfrP 2 ! + 4 (curl A) 2 dr 
w = . Aipdr 


(50.2) 


The unperturbed Hamiltonian // () can bo rewritten by expanding A in 
plane waves and in eigenfunctions of Eq. (49.17), where the nuclear 
potential —Zc 2 /r is replaced by V(r). We obtain from (49.20) and 
(48.32) 


1 P. A. M. Dime, I’ror. Hon. Soc., A112, Mil (102(i), Hoc. 5; A114, 243 (1927). 
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H 0 = 2 NJE n + ^ + *) 

n kX 

Nn = &*&„, iVkX = gjk O k „*O k x 


(50.3) 


The commutation relations are given by (49.19) and (48.28): 

[bnM + = K,b n ,*]+ = 0, [b n ,b n '*] + = S nn ' 

2 irhc 

[akx,ak'v] = [a%x,aVx'] = 0, [akx,a*k'x'] = —t— ^kk'^xx' 


The a’a and b’& commute with each other. 

The perturbing energy H ' in (50.2) can also be rewritten by making 
use of the expansions (49.18) and (48.35). We can choose t to be an 
arbitrary constant (say t = 0) in these expressions, since the time depend¬ 
ence of the field variables now arises from the wave functionals. The 
result is 


H‘ = eL~i £ b«ty(n,r)*x • a* 

nn'kX jl 

■ [otkx exp (ik • r) + akx* exp (—ik * r)]b n 'Wi(n' ,t)<It (50.4) 

We specify the unperturbed wave functionals by giving the quantum 
numbers n of the occupied electron states and the number of light quanta 
?ik\ in each plane wave state of the electromagnetic field: 

N ^ __ f ^ if the state n is occupied 

| 0 if the state n is empty (50.5) 

= ™kX* 

It then follows from (50.1) (with H replaced by H 0 ) together with (50.3) 
and (50.5), that ^ oscillates in time with a frequency that is determined 
by the sum of the energies of the electrons and light quanta present. If 
we omit the infinite zero-point energy of the electromagnetic field, we see 
that SF has the time dependence 

exp ^ E n + ^ Actekx^j (50.6) 

n kX 

where the prime denotes a summation over only the occupied electron 
states n. 

We are now in a position to apply the time-dependent perturbation 
theory of Sec. 29. The form of (50.6) shows that only transitions that 
conserve the total energy of electrons and light quanta take place. We 
require the matrix elements of H' in the representation in which H Q is 
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diagonal. These can be obtained by operating with (50.4) on the wave 
functionals ^ in accordance with Eqs. (48.34) for the a’s and equations 
like (46.32) for the b’ s, and making use of the orthonormality property 
expressed by the first of Eqs. (46.20). 

Matrix Elements of the Perturbation. The perturbation H' given by 
(50.4) is a sum of terms, each of which contains one 6, one b*, and either 
an a or an a*. Thus the effect of H' on a wave functional is to destroy 
an electron in one state, create an electron in the same or a different state, 
and either destroy or create a light quantum. This corresponds physi¬ 
cally to the transition of an electron from one state in the potential 7(r) to 
another, accompanied by the absorption or emission of a light quantum. 

We consider first a transition in which a light quantum is absorbed. 
The initial wave functional 'kie~^ Lt represents a state of the entire system 
in which there are n kX light quanta in the state k,X, an electron in the state 
n', and perhaps other light quanta and electrons that do not take part 
in the transition. The final wave functional SEr 2 e - ’ w,t represents a state in 
which there are n kx — 1 light quanta in the state k,\, an electron in the 
state n instead of the state n', and whatever other light quanta and elec¬ 
trons that were present in the initial state. The transition frequency 
o 2 i s ci )2 - oji is given by 

hu 2 ! = E n — E n f — hck (50.7) 

The matrix element of II' for this transition is 1 




21 



r)(exp ik • r)eic X 


anwi(n f ,r)dr 


(50.8) 


We can approximate to (50.8) in the nonrelativistic limit by replacing 
the Dirac single-electron wave functions tiv(n,r) by Schrodinger wave 
functions w n (r). We then see from Eq. (43.21) and Prob. 6, Chap. XII, 
that the velocity operator is — ca, and that this can be replaced by 
(—ih/m )grad in the nonrelativistic limit. The matrix element (50.8) 
then becomes 

H' n = ^ /,-« ( 2 ^V)‘ J w) n (r)(exp ik • r> kX • grad «v(r)dr (50.9) 

The factor exp ik ■ r can be placed on either side of the gradient operator, 
since only the component of grad along e kx comes in, and this is per¬ 
pendicular to k. 


ir The sign factor 0„0 B ', which arises from Eqs. (46.32), can be ignored in the 
remainder of thiH section. 
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Transition Probability for Absorption. We shall make use of the 
nonrelativistic approximation in the subsequent development, since the 
results obtained are then directly comparable with those of Chap. X. 
Equation (29.9) gives the amplitude of a particular wave functional at 
the time t. From this and Eq. (50.9) we obtain the total transition 
probability per unit time for absorption of a light quantum in any of the 
states k,X: 


1 V 4|ffaiI 2 sin 2 Igjrf 

t Z/ h 2 


8 ire 2 hn-^\ 

Zj m 2 o)L z 

kX 


/*• 


(exp ik • r)ctx * grad w dr 


sin 2 jco 2 i t 


(50.10) 


where we have pub co = he. 

We now suppose, as in the work of Sec. 35, that the states n and n' are 
discrete, and that the incident radiation covers a range of angular fre¬ 
quencies co in the neighborhood of ( E n — E n *)/h. The radiation can 
then be described by an intensity I(co)dco for the infinitesimal frequency 
range dco. It is convenient to replace the summation over states k,\ of 
the radiation field by an integration over co. Each quantum contributes 
an amount ho)/L z to the energy density, or an amount ftcco/L 8 to the 
intensity. We can therefore replace 

X by / | ) (60.11) 

kx 

The time-dependent factor on the right side of (50.10) has a sharp 
maximum at co 2 i = 0, which because of (50.7) is the same as 

Aco — E n — E n >. 

The other factors are relatively slowly varying; after the substitution 
(50.11) is made, they can be taken outside of the co integral, which 
can be written 

f * sin 2 faut x 

I - -2 n~~ dC021 — ST 

J — « C02it 

Thus the absorption probability (50.10) becomes 

(50.12) 

which agrees with the corresponding expression (35.17) obtained in 
Chap, X, 


4tV/Q 


m“cw 


- 1 / 


w n (exp ik • r)e kx • grad dr\ 
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Transition Probability for Emission. The energy relation (50.7) 
implies that the electronic state n has a higher energy than the state »'. 
We can now find the transition probability per unit time for emission of a 
light quantum when the electron goes from state n to state n\ The 
an* term in (50.4) is involved, and the matrix element analogous to (50.9) 
is 

^ Jj- 8 + 1 ) ^ J tf V [ exp (—{k . r )] E . grad Wn d Tj (50.13) 

where nn is the number of light quanta initially present in the electro¬ 
magnetic field state k,X. A calculation similar to that which led from 
(50.9) to (50.12) gives for the emission probability 

| J iM««P (-* • r)]«» • grad w n dr * 

+ | J lS »'[exp (-ik • r)]e k x • grad w n dr 

kX 

These two terms evidently arise from rikx and from 1, respectively, 
in the factor (nn + 1)* of (50.13). The first term is proportional to the 
intensity of the incident radiation, and agrees with the expression (36.19) 
for the induced emission probability. The second term is independent 
of the intensity of the radiation present initially; we now show that it 
agrees with the spontaneous emission probability obtained in Chap. X. 

Wo can simplify the second term in (50.14) by replacing the summa> 
tion over k with an integral over to or coai. In order to do this, we require 
an expression for the number of states of the electromagnetic field that lie 
in the angular-frequency range to to w + du. It follows from the discus¬ 
sion of periodic, boundary conditions in Sec. 11 [see Eqs. (11.3)] that there 
are ( L/2T)Hk x dk 1 jdk s plane waves with propagation vectors in the range 
dhjdkydk;. Thus if we specify the direction of the propagation vector k 
by the polar angles 6,$ with respect to some fixed set of axes, the number 
of plane waves with angular frequency in the range <2to that have direc¬ 
tions of propagation in the angular range ddd<t> is 

sin °d0d4>du. 

The integration over co in the second term of (50.14) can be carried out as 
before; the result is 

tfv[cxp (—tk * r)]€k\ • grad w n dr 
x 


f r ( 2r V cViu 
Jo Jo Z/ 2 ti- mV 


shi 2 jcoai t 
<4it 


(50.14) 


sin 6ddd<l) (50.15) 
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. -The. expression to the right of the summation in (50.15) is the prob¬ 
ability. per unit time for spontaneous emission of a light quantum with 
propagation vector k in the angular range d8d<t> 1 and polarization X. Thus 
(50.15) is the total spontaneous emission probability associated with the 
transition n —► n\ In order to compare it with the corresponding expres¬ 
sion in Sec. 36, we specialize to the dipole case by replacing exp (—ik • r) 
by unity and grad by — (mw/h) r in the integrand of the matrix element 
[see Eq. (35.20)]: 

2 

sin QdQd<j> (50.16) 
x 

The two directions of polarization for each propagation vector can be 
chosen in any way so long as they are perpendicular to each other and to 
k. If we choose one in the plane of k and the matrix element (r) n ' n , and 
the other perpendicular to this plane, only the former is emitted; then a 
factor sin 2 6 appears in the integrand, where 6 is the angle between k and 
(r) n ' n . Thus the polarization and angular distribution of the emitted 
radiation are in agreement with those found in Sec. 36. The total spon¬ 
taneous dipole emission probability obtained from (50.16) is 


r r* v e 2 * 8 f - 

Jo Jo 4 2 tt^ 3 ** J 


/TWndrl 


r/r “■ * (/ w n 'Xw n dT • J Wn'TWndr'j sin dddd# 


4e 2 o j 8 

3 *? 


l(r)n'n| 2 (50.17) 


which agrees with (36.22). 

Both the spontaneous and induced emission probabilities appear as 
the result of a single calculation in quantum electrodynamics, while these 
two effects were calculated in quite different ways in Chap. X. As was 
pointed out above, the spontaneous emission arises from the 1 in the 
factor (Wkx + 1)* that appears in the second of Eqs. (48.34). This in 
turn comes from the commutation relations (48.28), and hence is a 
purely quantum-mechanical effect. From a formal point of view, we can 
say that the spontaneous emission probability is equal to the probability 
of emission that would be induced by the presence of one quantum in 
each state of the electromagnetic field (see Prob. 4, Chap. X). Now we 
have already seen in Eq. (48.32) that the smallest possible energy of the 
field corresponds to the presence of one-half quantum per state. This 
suggests that we regard the spontaneous emission as being induced by 
the zero-point oscillations of the electromagnetic field; note, however, 
that these oscillations are twice as effective in producing emissive transi¬ 
tions as are actual quanta, and are incapable of producing absorptive 
transitions. 
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Analysis of a Diffraction Experiment. As our final example, we con¬ 
sider the diffraction experiment that was discussed in Sec. 2. 1 It is 
desirable to simplify the calculation as much as possible, and we start by 
reducing the arrangement of Fig. 1 to its essentials. These essentials 
consist of a source of light S, a diaphragm A with two slits cut in it, and a 
detector of light that can be placed at various points in the plane B . 
The source is taken to be a single atom that is in an excited state and can 
radiate a light quantum. The detector is taken to be a different kind of 
atom in its ground state, which becomes ionized if it absorbs the light 
quantum given off by the source atom (photoelectric effect). The 
diaphragm is assumed to be made of perfectly reflecting material; its 
atomic structure is ignored, and it simply imposes certain boundary 
conditions on the electromagnetic field. 

The physical process in which we are interested is that in which the 
initially excited source atom emits a light quantum, and the detector 
atom absorbs the quantum and is ionized. Actually, however, the light 
quantum cannot be observed in transit unless an arrangement like that 
of Fig. 2 is used, and we are not interested in such an experiment. What 
we wish to calculate is the transition probability from a state in which the 
source atom is excited, the detector atom is in its ground state, and no 
quanta are present, to a state in which the source atom is in its ground 
state, the detector atom is ionized, and no quanta are present. This 
tells us how the probability of observing ionization depends on position in 
the plane B when a light source is placed at S, We shall see that this 
probability is proportional to the intensity calculated by classical electro¬ 
magnetic theory when a light source is at S . In this way, quantum 
electrodynamics predicts both the diffraction pattern that is characteristic 
of light waves, and the ejected photoelectrons that are characteristic of 
light quanta. 

The form of the Hamiltonian [Eqs. (50.2) and (50.3)] tells us that elec¬ 
trons on different atoms interact with each other only through the cou¬ 
pling between each electron and the electromagnetic field. Thus the 
process with which we are concerned occurs only in second order of the 
perturbation energy IV of (50.2). 2 Since no quanta are present initially 
or finally, the intermediate states are those for which a single quantum is 
present, and either both atoms are in their ground states or the source 

1 See also G. Hacah, Accad. Lincei Rmd. t 11, 837, 1100 (1030); W. Heisenberg, 
Ann. d . Phyxik, 9, 338 (1031); E. Fermi, op. cii. } Sec. 10. 

a We assume that the spontaneous radiation probability of the source atom is small 
enough so that, tho remarks of footnote 1, page 192, are applicable. Then the over-all 
transition probability per unit, time is constant for times that are long enough so that 
energy is conserved between initial and final states. 
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atom is excited and the detector atom is ionized. The first type of 
intermediate state corresponds to a double process in which the source 
atom makes a transition to its ground state and emits a light quantum, 
and the detector atom makes a transition to its ionized state and absorbs 
this quantum. Since the intermediate state need not conserve energy, 
the light-quantum energy does not have to equal the initial excitation 
energy of the source atom (see Sec. 29). The second type of intermediate 
state corresponds to a double process in which the detector atom makes a 
transition to its ionized state and emits a light quantum, and the source 
atom makes a transition to its ground state and absorbs this quantum. 
In this case, it is apparent that the intermediate state cannot conserve 
energy. The second-order matrix element for the over-all process is 
obtained by summing expressions like (29.20) over all possible intermedi¬ 
ate light-quantum states of both types. 

The structure of the calculation that follows does not depend on the 
particular experimental arrangement of Fig. 1. The actual diffraction 
pattern is not found explicitly; instead it is shown that the quantum and 
classical results agree for any arrangement of perfectly reflecting dia¬ 
phragms, with or without slits. This conclusion is not surprising, since 
Maxwell's equations have the same form in classical and quantum electro¬ 
dynamics. Nevertheless, it is interesting for us to see explicitly in what 
way agreement is obtained. It is shown below that the summation over 
intermediate light-quantum states gives an expression that is equivalent 
to the point-source solution (Green's function) of the electromagnetic 
wave equation. 

Representation of the Electromagnetic Field. The expansion of the 
electromagnetic field in plane waves that was introduced in Sec. 48 and 
used earlier in this section is not suitable for the present problem, since 
plane waves do not satisfy the proper boundary conditions at the surface 
of the diaphragm. The functions that do satisfy these boundary con¬ 
ditions are quite complicated, and we do not attempt to find explicit 
expressions for them- We simply assume that they exist and form a 
complete set in terms of which the vector potential can be expanded. 
The assumption that the surface of the diaphragm is perfectly reflecting 
means in the general case that these functions have to be real. We make 
the set discrete by placing the entire system in a large but finite closed 
box that has perfectly reflecting walls. The cartesian components of 
the vector functions u*(r) satisfy the second-order wave equation 

V 2 ^ a + Uk* = 0, s = x,y } z (60.18) 
c 

where div u* = 0. Since the tangential components of the electric field 
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and the vector potential vanish at a perfectly reflecting surface, the 
boundary condition is 

n X Ufc = 0 (50.19) 

where n is the vector normal to the surface of the diaphragm or to the 
walls of the box. 

We now show that two of these functions that correspond to different 
eigenvalues are orthogonal to each other. We multiply (50.18) by 
Uk'a( r), the corresponding equation for u k ^(r) by u kB ( r), take the difference 
of the two results and sum over s = x,y,z. The result can be written, 
with the help of Green’s theorem, 

/ X ^ *£■') iA - / u *' ■ ■“* (60 - 20) 

where ( d/dn ) denotes the component of the gradient in the direction of 
the outward normal. The surface integral on the left extends over both 
the diaphragm and the walls of the box. According to (50.19), the 
tangential components of u* vanish on the boundary surfaces; this implies 
that the tangential derivatives of these components are also zero. Then 
since div u k = 0, the normal derivative of the normal component of 
u fc vanishes. It follows that at the bounding surfaces, u* is a vector 
perpendicular to the surface, and du k /dn is a vector parallel to the sur¬ 
face, so that their scalar product is zero. Thus the surface integral on the 
left side of (50.20) vanishes, and JW ■ n k dr = 0 if w 7* Any degen¬ 
erate solutions of (50.18) can bo chosen orthogonal to each other, and all 
solutions can be normalized in the entire region, so we can put 

Ju/,' * \x h (lr = 8 kk > (50.21) 

We now proceed as in Sec. 48 to expand A and P for the vacuum field 
in terms of the functions u k : 

A(r,/) = Y q k (t) u*(r), P(r,() = ^ p k (t) u*(r) (60.22) 

k k 

where q k and p k are Ilermitian and satisfy the commutation relations 

[<fa(0, Pfc'W] = (KO 

mo, moi = mo, pm = o ^ } 

Substitution of (50.22) info the electromagnetic field Hamiltonian (48,7) 
gives, with the help of (50.21), 
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The integral on the right can be simplified by means of a partial inte¬ 
gration : 

/(curl u*) • (curl u ( )dr = /u* • curl curl Uj dr 

where the surface integral vanishes because of the boundary conditions. 
This integral can be simplified further if Uj is expressed in cartesian coordi¬ 
nates and use is made of Eqs. (50.18) and (50.21): 

/(curl Uft) • (curl Ui)dr = — /u* • V*Ujdr 

~ ~2 ! J u * ' n ‘ dr = s *i 

The field Hamiltonian then becomes 

H, m = V (%rcY h + ^ qlj (50.24) 

k 

The quantum equations of motion for g* and pk are obtained from 
(45.23), (50.23), and (50.24): 

a ; 2 

q k = 4 TC*Vk, p k = - jjjJj q k 

These are readily integrated to give 

q k = a#-™' 1 + ate™' 1 

Ph = - ~ ate'"' 1 ) 


where a k and at are operators that do not depend on the time. It is 
easily verified that the a 1 s satisfy commutation relations like (48.28): 




2irhc 2 


5hk f 


with all other pairs commuting. Thus we can identify a k and a* with 
destruction and creation operators, respectively, for quanta in the state 
Jc of the electromagnetic field. The Hamiltonian (50.24) becomes 

H m = ^ + i), N k = ^ o*o* 

k 

This is in agreement with (48.32) since co* corresponds to the quantity ck 
in Sec. 48. 

Matrix Elements. The second-order matrix element (29.20) can be 
written as the sum of two terms that correspond to the two types of 
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Esi — Ego — hu>k 


(60.25) 


(H’s) oat.i is the matrix element for the transition of the source atom from 
the excited state of energy E 8 \ to the ground state of energy E S o with 
emission of a quantum in the state fc: 1 


(H'a)o*.i = Jjj J w SQ (T')n k (r') ■ grad' w si {x')dr' 

ojt is the matrix element for the transition of the detector atom from 
the ground state of energy E B o to an ionized state of energy E B i with 
absorption of the quantum in the state k : 

^ / ,B *( r ) u *( r ) • g ™ 4 w B o(r)dr 

The second term can similarly be written 



E no — Em — ttttih 


(50.26) 


It is readily verified from the earlier work that (#*)o,i* = (H r s)te.i and 
(H' B ) ikt o = 

The time-dependent perturbation theory of Sec. 29 tells us that the 
transition probability per unit time is appreciable only if energy is con¬ 
served between initial and final states. Thus we we interested in those 
ionized states for which 


Em — Ej jo = E a i — Ea o 


If we call this energy difference hw, the sum of (50,25) and (50.26) can 
be written 

X (50 - 27) 

fl.f?' » X,U,Z k 

The electronic wave functions w that appear in (50.27) are well 
localized around either the source atom or the detector atom. Since we 
are primarily interested in macroscopic observations, we neglect the 

1 As was done earlier in this section, wo change to a representation in which the 
time dependence appears in tho wave functionals rather than in the operators, since 
this lends itself more readily to application of tho time-dependent perturbation theory. 
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spatial extent of these wave functions. The transition probability then 
depends on the positions r' of source and r of detector through the expres¬ 
sion in square brackets. For a large containing box, the summation over 
k is to be replaced by an integration over co*, where the contour C is 
chosen in accordance with Eq. (29.24). We define a density of states 
p(fc), such that p(k)do> k is the number of states of the electromagnetic field 
in the angular-frequency range du h . We can then put 

[ ] = p(k)dm ^ p + iR (50 - 28) 

where P is the principal value of the integral and R is nr times the residue 
of the integrand at the pole = w. It is apparent that P and R are real. 
The transition probability is proportional to the square of the magnitude 
of (60.28), or to 

P 2 + R 2 (50.29) 

Classical Diffraction Pattern. We have now to compare this result 
with the classical expression for the intensity of light produced at r by a 
source at r'. This can be found from a solution of the wave equation 
(35.9) for the vector potential A(r^) that is produced by a current density 

J(r,0: 

(50.30) 

We are interested in a solution of Eq. (50.30) when the source current J 
is small in spatial dimensions and oscillates sinusoidally in the time. It 
will be apparent from what follows that an analytical difficulty is encoun¬ 
tered if the time dependence of J is purely harmonic. We therefore make 
the physically plausible assumption that J represents a damped oscilla¬ 
tor, and later take the limit in which the damping constant is negligibly 
small. We put 

J(r,0 = J(r)e“ 7 * cos o>t 

= ^J(r)[e ( ~ 7_h!w) * + e ( ” 7 ~ iw)/ ] 

Since Eq. (50.30) is linear, its steady-state solutions have the form 

A(r,i!) = ^Atr)^- 7 *^ + -jA(r)e ( ~ 7 ’ -ia,) ‘ (50.31) 

where A(r) satisfies the equation 

V’A(r) + 1 (« + iy)* A(r) = -^ J(r) (50.32) 

c c 

Equation (50.32) can be solved in the same way as was the inhomo¬ 
geneous wave equation (26.7). We expand A(r) in terms of the complete 
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set u*,(r) of real vector functions that satisfy the boundary conditions 
stated earlier: 

A(r) = £ A*u*(r) (50.33) 

k 

The constant expansion coefficients A k may be complex since A(r) is not 
necessarily real [the vector potential A(r,Z) is of course real]. Substitu¬ 
tion of (50.33) into (50.32) gives, with the help of (50.18), 

- 32 ^ A kWl - (t» + i 7 ) 2 ]u*(r) = - y J( r ) 

k 

This can be solved for A k with the help of the orthonormality relation 
(50.21). 

A ‘ - - 4 / -W • JM*' (60.34) 

Substitution of (50.34) into (50.33) gives for a cartesian component of 

A( r) 

A.(r) = j Yf 

The summation in square brackets can be replaced by an integration 
over real values of without difficulty, since for finite 7 the pole of the 
integrand is above the real axis. In the limit 7 0, the pole moves into 

the real axis, and the integral approaches the value computed with the 
contour C of Kq. (29.24). This is just the integral (50.28), so that we 
can replace the square-bracket expression in (50.35) by P+ iR- The 
intensity of light measured at r is proportional to the time average of the 
square of the vector potential (50.31). For a small current source 
located at the point r', this intensity is proportional (in the limit 7 —> 0 ) 
to 

\[(P + ilt)C io>t + (P — z/t! )f! ,u ^]" J Limn hvr. = 4[(P COS Cot ~ R 8111 (tit) 2 ]tim* av* 

- 2(P 2 + 7? 2 ) 

The agreement, between this result and (50.29) shows that the probability 
of finding an ionized atom at a particular point is proportional to the 
classically computed intensity of light at that point. 

Problems 

1. Calculate the com mutator bracket of [curl A(r,01 a and P a (r',t). Use the 
result to hIiow that the equation of motion for the quantity P in a vacuum electro¬ 
magnetic field ih given by the second of I0qn. (48.fi). 


<6o ' 36 > 
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I 

2. Show that if the commutator bracket of two cartesian components of A and P 
is given by 

[A.(xA r(^) 

then A and div P at different space points commute with each other. 

3. Show that the replacement of the last of the commutation relations (48.8) by 
that of Prob. 2 does not affect those commutator brackets that involve the field 
strengths. 

4. Show that div H commutes with the electromagnetic field Hamiltonian (48.15), 
and hence is a constant of the motion. 

6. Carry through the calculations implied in Eqs. (48.16), and show that the 
first two of Maxwell's equations are obtained. 

6. Calculate the commutator bracket of each of the bracket expressions in (48.18) 
and the Hamiltonian (48.22), and show that the bracket expressions are constant in 
time. This means that thes6 commutation laws are consistent with the equations 
of motion. 

7. Compare the D function given in Eqs. (47.30) and (47.33), and the Do function 
given in Eq. (48.39), from a physical point of view. 

8. Derive the expression (48.43) for the commutator bracket of E«(r,0 and 

H,(r',*0. 

9. Obtain the Lagrangian equations of motion for 4 /*, and A from Eq. (49.4). 

10. Find the quantum equation of motion for \f/ when the Hamiltonian is given by 
(49.11) with the substitution of (49.13) for (49.12). Show that it agrees with (49.1) 
if <f> is replaced by ej[«/'*(r',0^(r',i)/lr — r'l] dr'. What result is obtained if the above 
subsitution is not made? 

11. Show that the bracket expressions in (47.8) and (48.21) are constant in time 
for the Hamiltonian of Prob. 10. 

12. Verify Eq. (49.23) by making use of the anticommutation relations (49.19), 
and also by making use of operator equations like (46.32). 

13. Calculate a typical off-diagonal matrix element of the Coulomb interaction 
energy (49.21), and show that it is in agreement with what would be obtained from an 
antisymmetric many-electron wave function like (32.7). 

14. Carry through a calculation like that given in the latter part of Sec. 50, except 
that the diaphragm is omitted and plane waves with periodic boundary conditions are 
used in place of the real vector functions u*(r). Show explicitly that the probability 
of ionization of the detector atom is inversely proportional to the square of its dis¬ 
tance from the source atom. Compare this calculation with that of Sec. 30 which 
dealt with the production of a cloud-chamber track. 
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